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THERAPEUTIC AND DIAGNOSTIC METHODS AND COMPOSITIONS 
BASED ON NOTCH PROTEINS AND NUCLEIC ACIDS 



This application is a continuation-in-part of copending application 
5 Serial No. 08/083,590 filed June 25, 1993, which is a continuation-in-part of both 
application Serial No. 07/955,012 filed September 30, 1992, now abandoned, and 
copending application Serial No. 07/879,038 filed April 30, 1992, each of which 
is incorporated by reference herein in its entirety. 

This invention was made in part with government support under 
10 grant numbers GM 29093 and NS 26084 awarded by the National Institutes of 
Health. The government has certain rights in the invention. 

1. INTRODUCTION 
The present invention relates to therapeutic compositions 
comprising Notch proteins, analogs and derivatives thereof, antibodies thereto, 
nucleic acids encoding the Notch proteins, derivatives or analogs, Notch antisense 
nucleic acids, and toporythmic proteins which bind to Notch and their nucleic 
acids and antibodies. Therapeutic and diagnostic methods are also provided. 
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2. BACKGROUND OF THE INVENTION 



2.1. THE NOTCH GENE AND PROTEIN 
Null mutations in any one of the zygotic neurogenic loci - Notch 
(N), Delta (DI), mastermind (mam ). Enhancer of S^lit (EfspH . neuralized (Dsn), 

2 5 and big brain (bib) -result in hypertrophy of the nervous system at the expense of 
ventral and lateral epidermal structures. This effect is due to the misrouting of 
epidermal precursor cells into a neuronal pathway, and implies that neurogenic 
gene function is necessary to divert cells within the neurogenic region from a 
neuronal fate to an epithelial fate. Studies that assessed the effects of laser 

30 ablation of specific embryonic neuroblasts in grasshoppers (Doe and Goodman 
1985, Dev. Biol. Ill, 206-219) have shown that cellular interactions between 
neuroblasts and the surrounding accessory cells serve to inhibit these accessory 
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cells from adopting a neuroblast fate. Together, these genetic and developmental 
observations have led to the hypothesis that the protein products of the neurogenic 
loci function as components of a cellular interaction mechanism necessary for I 
proper epidermal development (Artavanis-Tsakonas, 1988, Trends Genet. 4, 95- r 
5 100). 

Sequence analyses (Wharton et at., 1985, Cell 43, 567-581; Kidd 
et al., 1986, Mol. Cell. Biol. 6, 3094-3108; Vassin et al., 1987, EMBO J. 6, 
3431-3440; Kopczynski et al., 1988, Genes Dev. 2, 1723-1735) have shown that 
two of the neurogenic loci, Notch and Delta , appear to encode transmembrane 

10 proteins that span the membrane a single time. The Drosophila Notch gene 
encodes a -300 kd protein (we use "Notch" to denote this protein) with a large 
N-terminal extracellular domain that includes 36 epidermal growth factor (EGF)- 
like tandem repeats followed by three other cysteine-rich repeats, designated 
Notch /lin-12 repeats (Wharton et al., 1985, Cell 43, 567-581; Kidd et al., 1986, 

15 Mol. Cell Biol. 6, 3094-3108; Yochem et al., 1988, Nature 335, 547-550). The 
sequences of Xenopus (Coffman et al., 1990, Science 249:1438-1441) and a 
human Notch homolog termed TAN-1 (Ellisen et al., 1991, Cell 66:649-661) have 
also been reported. Delta encodes a - 100 kd protein (we use "Delta" to denote 
DLZM, the protein product of the predominant zygotic and maternal transcripts; 

20 Kopczynski et al., 1988, Genes Dev. 2, 1723-1735) that has nine EGF-like 
repeats within its extracellular domain (Vassin et al., 1987, EMBO J. 6, 3431- 
3440; Kopczynski et al., 1988, Genes Dev. 2, 1723-1735). Although little is 
known about the functional significance of these repeats, the EGF-like motif has 
been found in a variety of proteins, including those involved in the blood clotting 

25 cascade (Furie and Furie, 1988, Cell 53, 505-518). In particular, this motif has 
been found in extracellular proteins such as the blood clotting factors IX and X 
(Rees et al., 1988, EMBO J. 7, 2053-2061; Furie and Furie, 1988, Cell 53, 505- 
518), in other Drosophila genes (Knust et al., 1987, EMBO J. 761-766; Romberg 
et al., 1988, Cell 55, 1047-1059), and in some cell-surface receptor proteins, 

30 such as thrombomodulin (Suzuki et al., 1987, EMBO J. 6, 1891-1897) and LDL 
receptor (Sudhof et al., 1985, Science 228, 815-822). A protein binding site has 
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been mapped to the EGF repeat domain in thrombomodulin and urokinase 
(Kurosawa et al., 1988, J. Biol. Chem 263, 5993-5996; Appella et al., 1987, J. 
Biol. Chem. 262, 4437-4440). 

An intriguing array of interactions between Notch and Delta 

5 mutations has been described (Vassin, et al., 1985, J. Neurogenet. 2, 291-308; 
Shepard et al., 1989, Genetics 122, 429-438; Xu et al., 1990, Genes Dev., 4, 
464-475). A number of genetic studies (summarized in Alton et al. , 1989, Dev. 
Genet. 10, 261-272) has indicated that the gene dosages of Notch and Delta in 
relation to one another are crucial for normal development. A 50% reduction in 

10 the dose of Delta in a wild-type Notch background causes a broadening of the 
wing veins creating a "delta" at the base (Lindsley and Grell, 1968, Publication 
Number 627, Washington, D.C., Carnegie Institute of Washington). A similar 
phenotype is caused by a 50% increase in the dose of Notch in a wild-type Delta 
background (a "Confluens" phenotype; Welshons, 1965, Science 150, 1 122- 

15 1129). This Delta phenotype is partially suppressed by a reduction in the Notch 
dosage. Work has shown that lethal interactions between alleles that correlate 
with alterations in the EGF-like repeats in Notch can be rescued by reducing the 
dose of Delta (Xu et al., 1990, Genes Dev. 4, 464-475). Xu et al. (1990, Genes 
Dev. 4, 464-475) found that null mutations at either Delta or mam suppress lethal 

20 interactions between heterozygous combinations of certain Notch alleles, known 
as the Abruptex (Ax) mutations. Ax alleles are associated with missense 
mutations within the EGF-like repeats of the Notch extracellular domain (Kelley 
et al., 1987, Cell 51, 539-548; Hartley et al., 1987, EMBO J. 6, 3407-3417). 

Recent studies have shown that Notch and Delta, and Notch and 

25 Serrate, directly interact on the molecular level (Fehon et al., 1990, Cell 61:523- 
534; Rebay et al., 1991, Cell 67:687-699). 

Notch is expressed on axonal processes during the outgrowth of 
embryonic neurons (Johansen et al., 1989, J. Cell Biol. 109:2427-2440; Kidd et 
al., 1989, Genes Dev. 3:1113-1129; Fehon et al., 1991, J. Cell Biol. 

30 113:657-669). 
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A study has shown that certain Ax alleles of Notch can severely 
alter axon pathfinding during sensory neural outgrowth in the imaginal discs, 
although it is not yet known whether aberrant Notch expression in the axon itself 
or the epithelium along which it grows is responsible for this defect (Palka et al., 
5 1990, Development 109, 167-175). 

2.2. CANCER 
A neoplasm, or tumor, is a neoplastic mass resulting from 
abnormal uncontrolled cell growth, which may cause swelling on the body 
10 surface, and which can be benign or malignant. Benign tumors generally remain 
localized. Malignant tumors are collectively termed cancers. The term 
"malignant" generally means that the tumor can invade and destroy neighboring 
body structures and spread to distant sites to cause death (for review, see Robbins 
and Angell, 1976, Basic Pathology, 2d Ed., W.B. Saunders Co., Philadelphia, 
15 pp. 68-122). 

Effective treatment and prevention of cancer remains a long-felt 
need, and a major goal of biomedical research. 

3. SUMMARY OF THE INVENTION 
20 The present invention relates to therapeutic and diagnostic methods 

and compositions based on Notch proteins and nucleic acids. The invention 
provides for treatment of disorders of cell fate or differentiation by administration 
of a therapeutic compound of the invention. Such therapeutic compounds (termed 
herein "Therapeutics") include: Notch proteins and analogs and derivatives 
25 (including fragments) thereof; antibodies thereto; nucleic acids encoding the 

Notch proteins, analogs, or derivatives; Notch antisense nucleic acids; as well as 
toporythmic proteins and derivatives which bind to or otherwise interact with 
Notch proteins, and their encoding nucleic acids and antibodies. In a preferred 
embodiment, a Therapeutic of the invention is administered to treat a cancerous 
30 condition, or to prevent progression from a pre-neoplastic or non-malignant state 
into a neoplastic or a malignant state. In other specific embodiments, a 
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Therapeutic of the invention is administered to treat a nervous system disorder or 
to promote tissue regeneration and repair. 

In one embodiment. Therapeutics which antagonize, or inhibit, 
Notch function (hereinafter "Antagonist Therapeutics 11 ) are administered for 

5 therapeutic effect; disorders which can be thus treated can be identified by in vitro 
assays such as described in Section 5.1, infra. Such Antagonist Therapeutics 
include but are not limited to Notch antisense nucleic acids, anti-Notch 
neutralizing antibodies, and competitive inhibitors of Notch protein-protein 
interactions (e.g., a protein comprising Notch ELR-11 and ELR-12 and 

10 derivatives thereof), all as detailed infra. 

In another embodiment, Therapeutics which promote Notch 
function (hereinafter "Agonist Therapeutics") are administered for therapeutic 
effect; disorders which can thus be treated can be identified by in vitro assays 
such as described in Section 5.1, infra. Such Agonist Therapeutics include but 

15 are not limited to Notch proteins and derivatives thereof comprising the 
intracellular domain, and proteins that interact with Notch (e.g., a protein 
comprising a Delta sequence homologous to Drosophila Delta amino acids 1-230 
(see Figure 1 and SEQ ID NO:2), or comprising a Serrate sequence homologous 
to Drosophila Serrate amino acids 79-282 (see Figure 5 and SEQ ID NO:4)). 

20 Disorders of cell fate, in particular hyperproliferative (e.g., 

cancer) or hypoproliferative disorders, involving aberrant or undesirable levels of 
expression or activity of Notch protein can be diagnosed by detecting such levels, 
as described more fully infra. 

In a preferred aspect, a Therapeutic of the invention is a protein 

25 consisting of at least a fragment (termed herein "adhesive fragment") of the 

proteins encoded by toporythmic genes which mediates binding to Notch proteins 
or adhesive fragments thereof. Toporythmic genes, as used herein, shall mean 
the genes Notch , Delta , and Serrate , as well as other members of the 
Deita / Serrate family which may be identified by virtue of sequence homology or 

^0 genetic interaction, and in general, members of the "Notch cascade" or the 
"Notch group" of genes, which are identified by molecular interactions (e.g., 
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binding in vitro) or genetic interactions (as detected phenotypically, e.g., in 
Drosophila). 

In another aspect, the invention is directed to human Notch 
proteins; in particular, that encoded by the hN homolog, and proteins comprising 
5 the extracellular domain of the protein and subsequences thereof. Nucleic acids 
encoding the foregoing, and recombinant cells are also provided. 

3.1. DEFINITIONS 
As used herein, the following terms shall have the meanings 

10 indicated: 

AA = amino acid 
EGF = epidermal growth factor 
ELR = EGF-iike (homologous) repeat 
IC = intracellular 
15 PCR = polymerase chain reaction 

As used herein, underscoring the name of a gene shall indicate the 
gene, in contrast to its encoded protein product which is indicated by the name of 
the gene in the absence of any underscoring. For example, " Notch " shall mean 
the Notch gene, whereas "Notch" shall indicate the protein product of the Notch 
20 gene. 

4. DESCRIPTION OF THE FIGURES 
Figure 1. Primary Nucleotide Sequence of the Delta cDNA Dll 
(SEQ ID NO:l) and Delta amino acid sequence (SEQ ID NO:2). The DNA 
25 sequence of the 5'-3' strand of the Dll cDNA is shown, which contains a number 
of corrections in comparison to that presented in Kopczynkski et al. (1988, Genes 
Dev. 2:1723-1735). 

Figure 2. Notch Expression Constructs and the Deletion Mapping 
of the Delta/Serrate Binding Domain. S2 cells in log phase growth were 
30 transiently transfected with the series of expression constructs shown; the 

drawings represent the predicted protein products of the various Notch deletion 
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mutants created. All expression constructs were derived from construct #1 
pMtNMg. Transiently transfected cells were mixed with Delta expressing cells 
from the stably transformed line L49-6-7 or with transiently transfected Serrate 
expressing cells, induced with CuS0 4 , incubated under aggregation conditions and 

5 then scored for their ability to aggregate using specific antisera and 

immunofluorescence microscopy. Aggregates were defined as clusters of four or 
more cells containing both Notch and Delta/Serrate expressing cells. The values 
given for % Aggregation refer to the percentage of all Notch expressing cells 
found in such clusters either with Delta (Dl) (left column) or with Serrate (Ser) 

10 (right column). The various Notch deletion constructs are represented 

diagrammatical ly with splice lines indicating the ligation junctions. Each EGF 
repeat is denoted as a stippled rectangular box and numbers of the EGF repeats 
on either side of a ligation junction are noted. At the ligation junctions, partial 
EGF repeats produced by the various deletions are denoted by open boxes and 

15 closed brackets (for example see #23 ACla+EGF(10-12)). Constructs #3-13 
represent the Clal deletion series. As diagrammed, four of the Clal sites, in 
repeats 7, 9, 17 and 26, break the repeat in the middle, immediately after the 
third cysteine (denoted by open box repeats; see Figure 3 for further 
clarification), while the fifth and most 3' site breaks neatly between EGF repeats 

20 30 and 31 (denoted by closed box repeat 31; again see Figure 3). In construct 
#15 split, EGF repeat 14 which carries the suiit point mutation, is drawn as a 
striped box. In construct #33 ACla+XEGF(10-13), the Xenopus Notch derived 
EGF repeats are distinguished from Drosophila repeats by a different pattern of 
shading. SP, signal peptide; EGF, epidermal growth factor repeat; N, Notch / lin - 

25 12 repeat; TM, transmembrane domain; cdclO, cdclO/ankyrin repeats; PA, 
putative nucleotide binding consensus sequence; opa, polyglutamine stretch 
termed opa; Dl, Delta; Ser, Serrate. 

Figure 3. Detailed Structure of Notch Deletion Constructs #19-24: 
Both EGF Repeats 11 and 12 are Required for Notch-Delta Aggregation. EGF 

30 repeats 10-13 are diagrammed at the top showing the regular spacing of the six 
cysteine residues (C). PCR products generated for these constructs (names and 
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numbers as given in Figure 2) are represented by the heavy black lines and the 
exact endpoints are noted relative to the various EGF repeats. Ability to 
aggregate with Delta is recorded as (+) or (-) for each construct. The PCR 
fragments either break the EGF repeats in the middle, just after the third cysteine 

5 in the same place as four out of the five Clal sites, or exactly in between two 
repeats in the same place as the most C-terminal Clal site. 

Figure 4. Comparison of Amino Acid Sequence of EGF Repeats 
11 and 12 from Drosophila and Xenopus Notch. The amino acid sequence of 
EGF repeats 11 and 12 of Drosophila Notch (SEQ ID NO: 14) (Wharton et al., 

10 1985, Cell 43:567-581; Kidd et al., 1986, Mol. Cell Biol. 6:3094-3108) is 
aligned with that of the same two EGF repeats from Xenopus Notch (SEQ ID 
NO:15) (Coffman et al., 1990, Science 249:1438-1441). Identical amino acids 
are boxed. The six conserved cysteine residues of each EGF repeat and the Ca + + 
binding consensus residues (Rees et al., 1988, EMBO J. 7:2053-2061) are 

15 marked with an asterisk (*). The leucine to proline change found in the Xenopus 
PCR clone that failed to aggregate is noted underneath. 

Figure 5. Nucleic Acid Sequence Homologies Between Serrate 
and Delta . A portion of the Drosophila Serrate nucleotide sequence (SEQ ID 
NO:3), with the encoded Serrate protein sequence (SEQ ID NO:4) written below 

20 (Fleming et al., 1990, Genes & Dev. 4:2188-2201 at 2193-94) is shown. The 
four regions showing high sequence homology with the Drosophila Delta 
sequence are numbered above the line and indicated by brackets. The total region 
of homology spans nucleotide numbers 627 through 1290 of the Serrate nucleotide 
sequence (numbering as in Figure 4 of Fleming et al., 1990, Genes & Dev. 

25 4:2188-2201). 

Figure 6. Schematic Diagram of Human Notch Clones. A 
schematic diagram of human Notch is shown. Heavy bold-face lines below the 
diagram show that portion of the Notch sequence contained in each of the four 
cDNA clones. The location of the primers used in PCR, and their orientation, 
30 are indicated by arrows. 
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Figure 7. Human Notch Sequences Aligned with Drosophila 
Notch Sequence. Numbered vertical lines correspond to Drosophila Notch 
coordinates. Horizontal lines below each map show where clones lie relative to 
stretches of sequence (thick horizontal lines). 
5 Figure 8. Nucleotide Sequences of Human Notch Contained in 

Plasmid cDNA Clone hN2k. Figure 8A: The DNA sequence (SEQ ID NO:5) of 
a portion of the human Notch insert is shown, starting at the EcoRI site at the 3' 
end, and proceeding in the 3' to 5' direction. Figure 8B: The DNA sequence 
(SEQ ID NO:6) of a portion of the human Notch insert is shown, starting at the 

!0 EcoRI site at the 5' end, and proceeding in the 5' to 3' direction. Figure 8C: 
The DNA sequence (SEQ ID NO:7) of a portion of the human Notch insert is 
shown, starting 3' of the sequence shown in Figure 8B, and proceeding in the 5' 
to 3' direction. The sequences shown are tentative, subject to confirmation by 
determination of overlapping sequences. 

15 Figure 9. Nucleotide Sequences of Human Notch Contained in 

Plasmid cDNA clone hN4k. Figure 9A: The DNA sequence (SEQ ID NO:8) of 
a portion of the human Notch insert is shown, starting at the EcoRI site at the 5' 
end, and proceeding in the 5' to 3' direction. Figure 9B: The DNA sequence 
(SEQ ID NO:9) of a portion of the human Notch insert is shown, starting near 

20 the 3' end, and proceeding in the 3' to 5' direction. The sequences shown are 
tentative, subject to confirmation by determination of overlapping sequences. 
Figure 10. DNA (SEQ ID NO: 10) and Amino Acid (SEQ ID 
NO: 11) Sequences of Human Notch Contained in Plasmid cDNA Clone hN3k. 

Figure 11. DNA (SEQ ID NO: 12) and Amino Acid (SEQ ID 

25 NO: 13) Sequences of Human Notch Contained in Plasmid cDNA Clone hN5k. 

Figure 12. Comparison of hN5k With Other Notch Homologs. 
Figure 12A. Schematic representation of Drosophila Notch. Indicated are the 
signal sequence (signal), the 36 EGF-like repeats, the three Notch /iin-12 repeats, 
the transmembrane domain (TM), the six CDC10 repeats, the OPA repeat, and 

30 the PEST (proline, glutamic acid, serine, threonine)-rich region. Figure 12B. 
Alignment of the deduced amino acid sequence of hN5k with sequences of other 



35 



WO 94/07474 



- 10 - 



4 

PCT/US93/09338 



Notch homologs. Amino acids are numbered on the left side. The cdclO and 
PEST-rich regions are both boxed, and individual cdclO repeats are marked. 
Amino acids which are identical in three or more sequences are highlighted. The 
primers used to clone hN5k are indicated below the sequences from which they 

5 were designed. The nuclear localization sequence (NLS), casein kinase II (CKH), 
and cdc2 kinase (cdc2) sites of the putative CcN motif of the vertebrate Notch 
homologs are boxed. The possible bipartite nuclear targeting sequence (BNTS) 
and proximal phosphorylation sites of Drosophila Notch are also boxed. 

Figure 13. Aligned amino acid sequences of Notch proteins of 

10 various species. humN: the human Notch protein encoded by the hN homolog 
(contained in part in plasmid hN5k) (SEQ ID NO: 19). TAN-1: the human Notch 
protein encoded by the TAN-1 homolog (SEQ ID NO:20) (the sequence shown is 
derived partly from our own work and partly from the TAN-1 sequence as published 
by Ellisen et al, 1991, Cell 66:649-661); Xen N: Xenopus Notch protein (Coffman 

IS et al., 1990, Science 249:1438-1441). Dros N: Drosophila Notch protein 
(Wharton et al., 1985, Cell 43:567-581). Structural domains are indicated. 

Figure 14. Immunocytochemical staining of breast cancer tissue 
from a human patient. Malignant breast tissue in a sample obtained from a 
human patient was embedded in a paraffin section, and subjected to 

20 immunocytochemical staining with anti-human Notch monoclonal antibody P4, 
directed against the TAN-1 protein. Non-malignant breast tissue exhibited much 
less staining (not shown). 

Figure 15. Immunocytochemical staining of colon tissue from a 
human patient with colon cancer. A colon tissue sample obtained from a patient 

25 with colon cancer was embedded in a paraffin section, and subjected to 

immunocytochemical staining with anti-human Notch monoclonal antibody PI, 
directed against the hN-encoded protein. Areas of increased staining are those 
areas in which malignant cells are present, as determined by cell morphology. 

Figure 16. Immunocytochemical staining of cervical tissue. 

30 Human tissue samples were obtained, containing cancer of the cervix (Fig. 16A) 
or normal cervical epithelium (Fig. 16B) from the same patient, embedded in a 
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paraffin section, and subjected to immunocytochemical staining with anti-human 
Notch monoclonal antibody directed against the TAN-1 protein. Areas containing 
malignant cells (as determined by morphology) exhibited increasing staining 
relative to non-malignant cells. Among non-malignant cells, connective tissue and 

5 the basal layer of the epithelium (containing stem cells) stained with the anti- 
Notch antibody. 

Figure 17. DNA (SEQ ID NO:21) and encoded amino acid 
sequence (contained in SEQ ID NO: 19) of human Notch homolog hN. The entire 
DNA coding sequence is presented (as well as noncoding sequence), with the 

10 exclusion of that encoding the initiator Met. The last 8 nucleotides shown 
(numbers 9716-9723) are vector, and not hN, sequences. 

5. DETAILED DESCRIPTION OF THE INVENTION 
The present invention relates to therapeutic and diagnostic methods 

15 and compositions based on Notch proteins and nucleic acids. The invention 

provides for treatment of disorders of cell fate or differentiation by administration 
of a therapeutic compound of the invention. Such therapeutic compounds (termed 
herein "Therapeutics") include: Notch proteins and analogs and derivatives 
(including fragments) thereof; antibodies thereto; nucleic acids encoding the 

20 Notch proteins, analogs, or derivatives; Notch antisense nucleic acids; as well as 
toporythmic proteins and derivatives and analogs thereof which bind to or 
otherwise interact with Notch proteins, and their encoding nucleic acids and 
antibodies. Also included are proteins and derivatives and analogs thereof which 
are capable of inhibiting the interactions of a Notch protein with another 

25 toporythmic protein {e.g. Delta, Serrate). In a preferred embodiment, a 

Therapeutic of the invention is administered to treat a cancerous condition, or to 
prevent progression from a pre-neoplastic or non-malignant state (eg., 
metaplastic condition) into a neoplastic or a malignant state. In another specific 
embodiment, a Therapeutic of the invention is administered to treat a nervous 

30 system disorder, such as nerve injury or a degenerative disease. In yet another 
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specific embodiment, a Therapeutic of the invention is administered to promote 
tissue regeneration and repair for treatment of various conditions. 

In one embodiment, Therapeutics which antagonize, or inhibit, 
Notch function (hereinafter "Antagonist Therapeutics") are administered for 
5 therapeutic effect; disorders which can be thus treated can be identified by in vitro 
assays such as described in Section 5.1, infra. Such Antagonist Therapeutics 
include but are not limited to Notch antisense nucleic acids, anti-Notch 
neutralizing antibodies, competitive inhibitors of Notch protein-protein 
interactions {e.g., a protein comprising Notch ELR-11 and ELR-12), and 
10 molecules which interfere with notch intracellular function such as that mediated 
by the cdclO repeats, as detailed infra. 

In another embodiment, Therapeutics which promote Notch 
function (hereinafter "Agonist Therapeutics") are administered for therapeutic 
effect; disorders which can thus be treated can be identified by in vitro assays 
15 such as described in Section 5.1, infra. Such Agonist Therapeutics include but 
are not limited to Notch proteins and derivatives thereof comprising the 
intracellular domain, Notch nucleic acids encoding the foregoing, and proteins 
comprising toporythmic protein domains that interact with Notch (e.g., a protein 
comprising an extracellular domain of a Delta protein or a Delta sequence 
20 homologous to Drosophila Delta amino acids 1-230 (see Figure 1 and SEQ ID 
NO:2), or comprising a Serrate sequence homologous to Drosophila Serrate 
amino acids 79-282 (see Figure 5 and SEQ ID NO:4)). 

Disorders of cell fate, in particular precancerous conditions such as 
metaplasia and dysplasia, and hyperproliferative (e.g. , cancer) or 
25 hypoproliferative disorders, involving aberrant or undesirable levels of expression 
or activity of Notch protein can be diagnosed by detecting such levels, as 
described more fully infra. 

In a preferred aspect, a Therapeutic of the invention is a protein 
consisting of at least a fragment (termed herein "adhesive fragment") of the 
30 proteins encoded by toporythmic genes which mediates binding to Notch proteins 
or adhesive fragments thereof. Toporythmic genes, as used herein, shall mean 
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the genes Notch . Delta , and Serrate , as well as other members of the 
DsUa/Seriate family which may be identified by virtue of sequence homology or 
genetic interaction, and, more generally, members of the "Notch cascade" or the 
"Notch group" of genes, which are identified by molecular interactions (e.g., 
5 binding in vitro) or genetic interactions (as detected phenotypically, e.g. , in 
Drosophila). 

The invention further provides a human Notch protein encoded by 
the hN homolog, and proteins comprising the extracellular domain of the Notch 
protein and subsequences thereof. Nucleic acids encoding the foregoing, and 
10 recombinant cells are also provided. 

For clarity of disclosure, and not by way of limitation, the detailed 
description of the invention is divided into the following subsections: 





(0 


Therapeutic Uses; 




(ii) 


Prophylactic Uses; 


15 


(iii) 


Demonstration of Therapeutic or Prophylactic Utility; 




(iv) 


Therapeutic/Prophylactic Administration and Compositions; 




(v) 


Antisense Regulation of Notch Expression; 




(vi) 


Diagnostic Utility; 




(vii) 


Notch Nucleic Acids; 


20 


(viii) 


Recombinant Production of Protein Therapeutics; 




(ix) 


Derivatives and Analogs of Notch and Other Toporythmic 
Proteins; 




(x) 


Assays of Notch Proteins, Derivatives and Analogs; and 




(xi) 


Antibodies to Notch Proteins, Derivatives and Analogs. 


25 




S 1 THFRAPEUT1C USES 



As stated supra, the Antagonist Therapeutics of the invention are 
those Therapeutics which antagonize, or inhibit, a Notch function. Such 
Antagonist Therapeutics are most preferably identified by use of known 
30 convenient in vitro assays, e.g. , based on their ability to inhibit binding of Notch 
to other proteins (see Sections 6-8 herein), or inhibit any known Notch function 
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as assayed in vitro, although genetic assays (e.g., in Drosophila) may also be 
employed. In a preferred embodiment, the Antagonist Therapeutic is a protein or 
derivative thereof comprising a functionally active fragment such as an adhesive 
fragment of Notch. In specific embodiments, such an Antagonist Therapeutic 

5 may be those adhesive proteins encoded by the appropriate constructs described in 
Sections 6 and 7 infra, or proteins comprising the Notch extracellular region, in 
particular ELR-11 and ELR-12, or an antibody thereto, or an analog/competitive 
inhibitor of a Notch intracellular signal-transducing region, a nucleic acid capable 
of expressing a Notch adhesive fragment, or a Notch antisense nucleic acid (see 

10 Section 5.5 herein). It should be noted that in certain instances, a Notch adhesive 
fragment (or possibly other presumed Antagonist Therapeutics) may alternatively 
act as an Agonist Therapeutic, depending on the developmental history of the 
tissue being exposed to the Therapeutic; preferably, suitable in vitro or in vivo 
assays, as described infra, should be utilized to determine the effect of a specific 

15 Therapeutic and whether its administration is indicated for treatment of the 
affected tissue. 

In another embodiment of the invention, a nucleic acid containing 
a portion of a Notch gene is used, as an Antagonist Therapeutic, to promote 
Notch inactivation by homologous recombination (Koller and Smithies, 1989, 
20 Proc. Natl. Acad. Sci. USA 86:8932-8935; Zijlstra et al. f 1989, Nature 342:435- 
438). 

The Agonist Therapeutics of the invention, as described supra, 
promote Notch function. Such Agonist Therapeutics include but are not limited 
to proteins and derivatives comprising the portions of toporythmic proteins such 

25 as Delta or Serrate that mediate binding to Notch, and nucleic acids encoding the 
foregoing (which can be administered to express their encoded products in vivo). 
In a specific embodiment, such a portion of Delta is D. melanogaster Delta amino 
acids 1-230 (SEQ ID NO:l) or a portion of a human Delta most homologous 
thereto. In another specific embodiment, such a portion of Serrate is D. 

30 melanogaster Serrate amino acids 79-282 (SEQ ID NO:5), or a portion of a 
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human Serrate most homologous thereto. In other specific embodiments, such a 
portion of Delta or Serrate is the extracellular portion of such protein. 

Further descriptions and sources of Therapeutics of the inventions 
are found in Sections 5.4 through 5.8 herein. 

5 The Agonist and Antagonist Therapeutics of the invention have 

therapeutic utility for disorders of cell fate. The Agonist Therapeutics are 
administered therapeutically (including prophylactically): (1) in diseases or 
disorders involving an absence or decreased (relative to normal, or desired) levels 
of Notch function, for example, in patients where Notch protein is lacking, 

10 genetically defective, biologically inactive or underactive, or underexpressed; and 
(2) in diseases or disorders wherein in vitro (or in vivo) assays (see infra) indicate 
the utility of Notch agonist administration. The absence or decreased levels in 
Notch function can be readily detected, e.g., by obtaining a patient tissue sample 
(e.g. , from biopsy tissue) and assaying it in vitro for protein levels, structure 

15 and/or activity of the expressed Notch protein. Many methods standard in the art 
can be thus employed, including but not limited to immunoassays to detect and/or 
visualize Notch protein (e.g., Western blot, immunoprecipitation followed by 
sodium dodecyl sulfate polyacrylamide gel electrophoresis, immunocytochemistry, 
etc.; see also those assays listed in Section 5.6, infra), and/or hybridization assays 

20 to detect Notch expression by detecting and/or visualizing Notch mRNA (e.g., 
Northern assays, dot blots, in situ hybridization, etc.) 

In vitro assays which can be used to determine whether 
administration of a specific Agonist Therapeutic or Antagonist Therapeutic is 
indicated, include in vitro cell culture assays in which a patient tissue sample is 

25 grown in culture, and exposed to or otherwise administered a Therapeutic, and 
the effect of such Therapeutic upon the tissue sample is observed. In one 
embodiment, where the patient has a malignancy, a sample of cells from such 
malignancy is plated out or grown in culture, and the cells are then exposed to a 
Therapeutic. A Therapeutic which inhibits survival or growth of the malignant 

^0 cells (e.g., by promoting terminal differentiation) is selected for therapeutic use in 
vivo. Many assays standard in the art can be used to assess such survival and/or 
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growth; for example, cell proliferation can be assayed by measuring 3 H-thymidine 
incorporation, by direct cell count, by detecting changes in transcriptional activity 
of known genes such as proto-oncogenes (e.g. Jos, myc) or cell cycle markers; 
cell viability can be assessed by trypan blue staining, differentiation can be 

5 assessed visually based on changes in morphology, etc. In a specific aspect, the 
malignant cell cultures are separately exposed to (1) an Agonist Therapeutic, and 
(2) an Antagonist Therapeutic; the result of the assay can indicate which type of 
Therapeutic has therapeutic efficacy. 

In another embodiment, a Therapeutic is indicated for use which 

10 exhibits the desired effect, inhibition or promotion of cell growth, upon a patient 
cell sample from tissue having or suspected of having a hyper- or 
hypoproliferative disorder, respectively. Such hyper- or hypoproliferative 
disorders include but are not limited to those described in Sections 5.1.1 through 
5.1.3 infra. 

15 In another specific embodiment, a Therapeutic is indicated for use 

in treating nerve injury or a nervous system degenerative disorder (see Section 
5.1.2) which exhibits in vitro promotion of nerve regeneration/neurite extension 
from nerve cells of the affected patient type. 

In addition, administration of an Antagonist Therapeutic of the 

20 invention is also indicated in diseases or disorders determined or known to 
involve a Notch dominant activated phenotype ("gain of function" mutations.) 
Administration of an Agonist Therapeutic is indicated in diseases or disorders 
determined or known to involve a Notch dominant negative phenotype ("loss of 
function" mutations). We have investigated the functions of various structural 

25 domains of the Notch protein in vivo, by ectopically expressing a series of 

Drosophila Notch deletion mutants under the hsp70 heat-shock promoter, as well 
as eye-specific promoters. Two classes of dominant phenotypes were observed, 
one suggestive of Notch loss-of function mutations and the other of Notch gain-of- 
fiinction mutations. Dominant "activated" phenotypes resulted from 

30 overexpression of a protein lacking most extracellular sequences, while dominant 
"negative" phenotypes resulted from overexpression of a protein lacking most 
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intracellular sequences. Our results indicate that Notch functions as a receptor 
whose extracellular domain mediates ligand-binding, resulting in the transmission 
of developmental signals by the cytoplasmic domain. The phenotypes observed 
also suggested that the cdclO/ankyrin repeat region within the intracellular domain 
5 plays an essential role in Notch mediated signal transduction events (intracellular 
function). 

In various specific embodiments, in vitro assays can be carried out 
with representative cells of cell types involved in a patient's disorder, to 
determine if a Therapeutic has a desired effect upon such cell types. 

10 In another embodiment, cells of a patient tissue sample suspected 

of being pre-neoplastic are similarly plated out or grown in vitro, and exposed to 
a Therapeutic. The Therapeutic which results in a cell phenotype that is more 
normal (i.e., less representative of a pre-neoplastic state, neoplastic state, 
malignant state, or transformed phenotype) is selected for therapeutic use. Many 

15 assays standard in the art can be used to assess whether a pre-neoplastic state, 
neoplastic state, or a transformed or malignant phenotype, is present (see Section 
5.2.1). For example, characteristics associated with a transformed phenotype (a 
set of in vitro characteristics associated with a tumorigenic ability in vivo) include 
a more rounded cell morphology, looser substratum attachment, loss of contact 

20 inhibition, loss of anchorage dependence, release of proteases such as 

plasminogen activator, increased sugar transport, decreased serum requirement, 
expression of fetal antigens, disappearance of the 250,000 dalton surface protein, 
etc. (see Luria et al., 1978, General Virology, 3d Ed., John Wiley & Sons, New 
York pp. 436-446). 

25 in other specific embodiments, the in vitro assays described supra 

can be carried out using a cell line, rather than a cell sample derived from the 
specific patient to be treated, in which the cell line is derived from or displays 
characteristic(s) associated with the malignant, neoplastic or pre-neoplastic 
disorder desired to be treated or prevented, or is derived from the neural or other 

30 cell type upon which an effect is desired, according to the present invention. 



35 



PCT/US93/09338 
WO 94/07474 

- 18 - 



10 



The Antagonist Therapeutics are administered therapeutically 
(including prophylactically): (1) in diseases or disorders involving increased 
(relative to normal, or desired) levels of Notch function, for example, where the 
Notch protein is overexpressed or overactive; and (2) in diseases or disorders 
wherein in vitro (or in vivo) assays indicate the utility of Notch antagonist 
administration. The increased levels of Notch function can be readily detected by 
methods such as those described above, by quantifying protein and/or RNA. In 
vitro assays with cells of patient tissue sample or the appropriate cell line or cell 
type, to determine therapeutic utility, can be carried out as described above. 



5.1.1. MALIGNANCIES 
Malignant and pre-neoplastic conditions which can be tested as 
described supra for efficacy of intervention with Antagonist or Agonist 
Therapeutics, and which can be treated upon thus observing an indication of 
15 therapeutic utility, include but are not limited to those described below in Sections 

5.1.1 and 5.2.1. 

Malignancies and related disorders, cells of which type can be 
tested in vitro (and/or in vivo), and upon observing the appropriate assay result, 
treated according to the present invention, include but are not limited to those 
20 listed in Table 1 (for a review of such disorders, see Fishman et al., 1985, 
Medicine, 2d Ed., J.B. Lippincott Co., Philadelphia): 



TABLE 1 

25 MALIGNANCIES AND RELA TED DISORDERS 

Leukemia 

acute leukemia 

acute lymphocytic leukemia 
acute myelocytic leukemia 
myeloblasts 
promyelocytic 
myelomonocytic 
monocytic 
erythroleukemia 
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chronic leukemia 

chronic myelocytic (granulocytic) leukemia 
chronic lymphocytic leukemia 
Polycythemia vera 
Lymphoma 

Hodgkin's disease 
5 non-Hodgkin's disease 

Multiple myeloma 
Waldenstrom's macroglobulinemia 
Heavy chain disease 
Solid tumors 

sarcomas and carcinomas 
fibrosarcoma 

jq myxosarcoma 

liposarcoma 

chondrosarcoma 

osteogenic sarcoma 

chordoma 

angiosarcoma 

endotheliosarcoma 

lymphangiosarcoma 
15 lymphangioendotheliosarcoma 

synovioma 

mesothelioma 

Ewing's tumor 

leiomyosarcoma 

rhabdomyosarcoma 

colon carcinoma 
2Q pancreatic cancer 

breast cancer 

ovarian cancer 

prostate cancer 

squamous cell carcinoma 

basal cell carcinoma 

adenocarcinoma 

sweat gland carcinoma 
25 sebaceous gland carcinoma 

papillary carcinoma 

papillary adenocarcinomas 

cystadenocarcinoma 

medullary carcinoma 

bronchogenic carcinoma 

renal cell carcinoma 
2q hepatoma 

bile duct carcinoma 

choriocarcinoma 

seminoma 
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embryonal carcinoma 

Wilms* tumor 

cervical cancer 

testicular tumor 

lung carcinoma 

small cell lung carcinoma 

bladder carcinoma 

epithelial carcinoma 

glioma 

astrocytoma 

medulloblastoma 

craniopharyngioma 

ependymoma 

pineaioma 

hemangioblastoma 

acoustic neuroma 

oligodendroglioma 

menangioma 

melanoma 

neuroblastoma 

retinoblastoma 



15 



In specific embodiments, malignancy or dysproliferative changes 
(such as metaplasias and dysplasias) are treated or prevented in epithelial tissues 
such as those in the cervix, esophagus, and lung. 

20 

As detailed in the examples section 10.1 infra, malignancies of the 
breast, colon, and cervix exhibit increased expression of human Notch relative to 
such non-malignant tissue. Thus, in specific embodiments, malignancies of the 
breast, colon, or cervix are treated or prevented by administering an effective 
amount of an Antagonist Therapeutic of the invention. The presence of increased 

25 

Notch expression in breast, colon, and cervical cancer suggests that many more 
cancerous conditions exhibit upregulated Notch. Thus, we envision that many 
more cancers, e.g., seminoma, melanoma, and lung cancer, can be treated or 
prevented by administration of an Antagonist Therapeutic. 

30 
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5.1.2. NERVOUS SYSTEM DISORDERS 
Nervous system disorders, involving cell types which can be tested 
as described supra for efficacy of intervention with Antagonist or Agonist 
Therapeutics, and which can be treated upon thus observing an indication of 
5 therapeutic utility, include but are not limited to nervous system injuries, and 

diseases or disorders which result in either a disconnection of axons, a diminution 
or degeneration of neurons, or demyelination. Nervous system lesions which may 
be treated in a patient (including human and non-human mammalian patients) 
according to the invention include but are not limited to the following lesions of 
10 either the central (including spinal cord, brain) or peripheral nervous systems: 
(i) traumatic lesions, including lesions caused by physical 
injury or associated with surgery, for example, lesions 
which sever a portion of the nervous system, or 
compression injuries; 

IS (ii) ischemic lesions, in which a lack of oxygen in a portion of 

the nervous system results in neuronal injury or death, 
including cerebral infarction or ischemia, or spinal cord 
infarction or ischemia; 

(iii) malignant lesions, in which a portion of the nervous system 
20 is destroyed or injured by malignant tissue which is either a 

nervous system associated malignancy or a malignancy 
derived from non-nervous system tissue; 

(iv) infectious lesions, in which a portion of the nervous system 
is destroyed or injured as a result of infection, for 

25 example, by an abscess or associated with infection by 

human immunodeficiency virus, herpes zoster, or herpes 
simplex virus or with Lyme disease, tuberculosis, syphilis; 

(v) degenerative lesions, in which a portion of the nervous 
system is destroyed or injured as a result of a degenerative 

30 process including but not limited to degeneration associated 
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with Parkinson's disease, Alzheimer's disease, 
Huntington's chorea, or amyotrophic lateral sclerosis; 
(vi) lesions associated with nutritional diseases or disorders, in 
which a portion of the nervous system is destroyed or 
5 injured by a nutritional disorder or disorder of metabolism 

including but not limited to, vitamin B12 deficiency, folic 
acid deficiency, Wernicke disease, tobacco-alcohol 
amblyopia, Marchiafava-Bignami disease (primary 
degeneration of the corpus callosum), and alcoholic 
10 cerebellar degeneration; 

(vii) neurological lesions associated with systemic diseases 

including but not limited to diabetes (diabetic neuropathy, 
Bell's palsy), systemic lupus erythematosus, carcinoma, or 
sarcoidosis; 

15 (viii) lesions caused by toxic substances including alcohol, lead, 

or particular neurotoxins; and 
(ix) demyelinated lesions in which a portion of the nervous 

system is destroyed or injured by a demyelinating disease 
including but not limited to multiple sclerosis, human 
20 immunodeficiency virus-associated myelopathy, transverse 

myelopathy or various etiologies, progressive multifocal 
leukoencephalopathy, and central pontine myelolysis. 
Therapeutics which are useful according to the invention for 
treatment of a nervous system disorder may be selected by testing for biological 
25 activity in promoting the survival or differentiation of neurons (see also Section 
5.1). For example, and not by way of limitation. Therapeutics which elicit any 
of the following effects may be useful according to the invention: 

(i) increased survival time of neurons in culture; 

(ii) increased sprouting of neurons in culture or in vivo; 

30 
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(iii) increased production of a neuron-associated molecule in 
culture or in vivo, e.g., choline acetyltransf erase or 
acetylcholinesterase with respect to motor neurons; or 

(iv) decreased symptoms of neuron dysfunction in vivo. 

5 Such effects may be measured by any method known in the art. In preferred, 
non-limiting embodiments, increased survival of neurons may be measured by the 
method set forth in Arakawa et al. (1990, J. NeuroscL 10:3507-3515); increased 
sprouting of neurons may be detected by methods set forth in Pestronk et al. 
(1980, Exp. Neurol. 70:65-82) or Brown et al. (1981, Ann. Rev. NeuroscL 
10 4:17-42); increased production of neuron-associated molecules may be measured 
by bioassay, enzymatic assay, antibody binding, Northern blot assay, etc., 
depending on the molecule to be measured; and motor neuron dysfunction may be 
measured by assessing the physical manifestation of motor neuron disorder, e.g., 
weakness, motor neuron conduction velocity, or functional disability. 
15 In a specific embodiments, motor neuron disorders that may be 

treated according to the invention include but are not limited to disorders such as 
infarction, infection, exposure to toxin, trauma, surgical damage, degenerative 
disease or malignancy that may affect motor neurons as well as other components 
of the nervous system, as well as disorders that selectively affect neurons such as 
20 amyotrophic lateral sclerosis, and including but not limited to progressive spinal 
muscular atrophy, progressive bulbar palsy, primary lateral sclerosis, infantile and 
juvenile muscular atrophy, progressive bulbar paralysis of childhood (Fazio- 
Londe syndrome), poliomyelitis and the post polio syndrome, and Hereditary 
Motorsensory Neuropathy (Charcot-Marie-Tooth Disease). 

25 

5.1.3. TISSUE REPAIR AND REGENERATION 
In another embodiment of the invention, a Therapeutic of the 
invention is used for promotion of tissue regeneration and repair, including but 
not limited to treatment of benign dysproliferative disorders. Specific 
30 embodiments are directed to treatment of cirrhosis of the liver (a condition in 
which scarring has overtaken normal liver regeneration processes), treatment of 
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keloid (hypertrophic scar) formation (disfiguring of the skin in which the scarring 
process interferes with normal renewal), psoriasis (a common skin condition 
characterized by excessive proliferation of the skin and delay in proper cell fate 
determination), and baldness (a condition in which terminally differentiated hair 
5 follicles (a tissue rich in Notch) fail to function properly). 



10 progression to a neoplastic or malignant state, including but not limited to those 
disorders listed in Table 1. Such administration is indicated where the 
Therapeutic is shown in assays, as described supra, to have utility for treatment 
or prevention of such disorder. Such prophylactic use is indicated in conditions 
known or suspected of preceding progression to neoplasia or cancer, in particular, 

15 where non-neoplastic cell growth consisting of hyperplasia, metaplasia, or most 
particularly, dysplasia has occurred (for review of such abnormal growth 
conditions, see Robbins and Angell, 1976, Basic Pathology, 2d Ed., W.B. 
Saunders Co., Philadelphia, pp. 68-79.) Hyperplasia is a form of controlled cell 
proliferation involving an increase in cell number in a tissue or organ, without 

20 significant alteration in structure or function. As but one example, endometrial 
hyperplasia often precedes endometrial cancer. Metaplasia is a form of controlled 
cell growth in which one type of adult or fully differentiated cell substitutes for 
another type of adult cell. Metaplasia can occur in epithelial or connective tissue 
cells. Atypical metaplasia involves a somewhat disorderly metaplastic epithelium. 

25 Dysplasia is frequently a forerunner of cancer, and is found mainly in the 

epithelia; it is the most disorderly form of non-neoplastic cell growth, involving a 
loss in individual cell uniformity and in the architectural orientation of cells. 
Dysplastic cells often have abnormally large, deeply stained nuclei, and exhibit 
pleomorphism. Dysplasia characteristically occurs where there exists chronic 

30 irritation or inflammation, and is often found in the cervix, respiratory passages, 
oral cavity, and gall bladder. 



5.2. PROPHYLACTIC USES 
5.2.1. MALIGNANCIES 
The Therapeutics of the invention can be administered to prevent 
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Alternatively or in addition to the presence of abnormal cell 
growth characterized as hyperplasia, metaplasia, or dysplasia, the presence of one 
or more characteristics of a transformed phenotype, or of a malignant phenotype, 
displayed in wvo or displayed in viiro by a cell sample from a patient, can 
5 indicate the desirability of prophylactic/therapeutic administration of a Therapeutic 
of the invention. As mentioned supra, such characteristics of a transformed 
phenotype include morphology changes, looser substratum attachment, loss of 
contact inhibition, loss of anchorage dependence, protease release, increased 
sugar transport, decreased serum requirement, expression of fetal antigens, 
10 disappearance of the 250,000 dalton cell surface protein, etc. (see also id., at pp. 
84-90 for characteristics associated with a transformed or malignant phenotype). 

In a specific embodiment, leukoplakia, a benign-appearing 
hyperplastic or dysplastic lesion of the epithelium, or Bowen's disease, a 
carcinoma in situ, are pre-neoplastic lesions indicative of the desirability of 
15 prophylactic intervention. 

In another embodiment, fibrocystic disease (cystic hyperplasia, 
mammary dysplasia, particularly adenosis (benign epithelial hyperplasia)) is 
indicative of the desirability of prophylactic intervention. 

In other embodiments, a patient which exhibits one or more of the 
20 following predisposing factors for malignancy is treated by administration of an 
effective amount of a Therapeutic: a chromosomal translocation associated with a 
malignancy (e.g., the Philadelphia chromosome for chronic myelogenous 
leukemia, t(14;18) for follicular lymphoma, etc.), familial polyposis or Gardner's 
syndrome (possible forerunners of colon cancer), benign monoclonal gammopathy 
25 (a possible forerunner of multiple myeloma), and a first degree kinship with 
persons having a cancer or precancerous disease showing a Mendelian (genetic) 
inheritance pattern (e.g., familial polyposis of the colon, Gardner's syndrome, 
hereditary exostosis, polyendocrine adenomatosis, medullary thyroid carcinoma 
with amyloid production and pheochromocytoma, Peutz-Jeghers syndrome, 
30 neurofibromatosis of Von Recklinghausen, retinoblastoma, carotid body tumor, 
cutaneous melanocarcinoma, intraocular melanocarcinoma, xeroderma 
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pigmentosum, ataxia telangiectasia, Chediak-Higashi syndrome, albm.sm, 
Fanconi's aplastic anemia, and Bloom's syndrome; see Robbins and Ange.l, 1976, 
Basic Pathology, 2d Ed., W.B. Saunders Co., Philadelphia, pp. 112-113) etc.) 

in another specific embodiment, an Antagonist Therapeutic of the 
5 invention is administered to a human patient to prevent progression to breast, 
colon, or cervical cancer. 

5.2.2. nT H FR DISORDERS 
In other embodiments, a Therapeutic of the invention can be 
10 administered to prevent a nervous system disorder described in Section 5.1.2, or 
other disorder liver cirrhosis, psoriasis, keloids, baldness) described m 
Section 5.1.3. 



15 53 DEMONSTRATION OF THERAPEUTIC 

np PROPHYLAC TIC UTILITY 

The Therapeutics of the invention can be tested in vivo for the 
desired therapeutic or prophylactic activity. For example, such compounds can 
be tested in suitable animal model systems prior to testing in humans, includmg 
but not limited to rats, mice, chicken, cows, monkeys, rabbits, etc. For in v,vo 
testing, prior to administration to humans, any animal model system known in the 
art may be used. 



20 



5 4 THERAPEUTIC/PROPHYLACTIC 

ADMINISTRAT ION AND COMPOSITIONS 

25 ^ inv ention provides methods of treatment (and prophylaxis) by 

administration to a subject of an effective amount of a Therapeutic of the 
invention. In a preferred aspect, the Therapeutic is substantially purified. The 
subject is preferably an animal, including but not limited to animals such as cows, 
pigs, chickens, etc., and is preferably a mammal, and most preferably human. 

30 various delivery systems are known and can be used to administer 

a Therapeutic of the invention, e.g., encapsulation in liposomes, microparticles, 
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microcapsules, expression by recombinant cells, receptor-mediated endocytosis 
(see, e.g., Wu and Wu, 1987, J. Biol. Chem. 262:4429-4432), construction of a 
Therapeutic nucleic acid as part of a retroviral or other vector, etc. Methods of 
introduction include but are not limited to intradermal, intramuscular, 
5 intraperitoneal, intravenous, subcutaneous, intranasal, and oral routes. The 
compounds may be administered by any convenient route, for example by 
infusion or bolus injection, by absorption through epithelial or mucocutaneous 
linings (e.g., oral mucosa, rectal and intestinal mucosa, etc.) and may be 
administered together with other biologically active agents. Administration can be 
10 systemic or local. In addition, it may be desirable to introduce the 

pharmaceutical compositions of the invention into the central nervous system by 
any suitable route, including intraventricular and intrathecal injection; 
intraventricular injection may be facilitated by an intraventricular catheter, for 
example, attached to a reservoir, such as an Ommaya reservoir. 
15 in a specific embodiment, it may be desirable to administer the 

pharmaceutical compositions of the invention locally to the area in need of 
treatment; this may be achieved by, for example, and not by way of limitation, 
local infusion during surgery, topical application, e.g., in conjunction with a 
wound dressing after surgery, by injection, by means of a catheter, by means of a 
20 suppository, or by means of an implant, said implant being of a porous, non- 
porous, or gelatinous material, including membranes, such as sialasttc 
membranes, or fibers. In one embodiment, administration can be by direct 
injection at the site (or former site) of a malignant tumor or neoplastic or pre- 
neoplastic tissue. 

25 In a specific embodiment, administration of a Therapeutic into a 

Notch-expressing cell is accomplished by linkage of the Therapeutic to a Delta (or 
other toporythmic) protein or portion thereof capable of mediating binding to 
Notch. Contact of a Notch-expressing cell with the linked Therapeutic results in 
binding of the linked Therapeutic via its Delta portion to Notch on the surface of 

30 the cell, followed by uptake of the linked Therapeutic into the Notch-expressing 
cell. 
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In a specific embodiment wherein an analog of a Notch 
^cellular signal-transducing domain is employed as a Therapeutic, such that ,t 
can inhibit Notch sign, transduction, fte analog is preferably dehvered 
in trace,.u,ar,y (.,, by expression from a nuc.eic acid vector, or by Imfcage^ 
5 Delta protein capabie of binding to Notch foilowed by binding and mtemahzat.on, 

or bv receptor-mediated mechanisms). 

,„ a specific embodimem where the Therapeutic is a r,uc.e,c aad 
.coding a pa* Therapeutic, the »u=.eic acid can te administered in «. to 
promote expression of Us encoded protein, b, constructing it as par. of an 
N appropriate nuc,eic acid expression veer and administering it » 

Le.iu.ar, e. 8 ., b, use of a retro.ira, vector (see U.S. Paten, No. 4,980,286), 
or by direct injection, or by use of micropan.Ce bombardment (e.g. , a gene gun, 
Bio.is.ic, Dupont), or coating whh iipids or cei.-surface recess or transfectmg 
agents or by administering it in hnkage to a homeobox-.ike peptide whtch ,s 
15 known to enter the nucleus (see , S . . Jolio, e, a... .99. , Proc. Na„. Acad. Sc, 
USA 88 1864-1868), «c. Alternatively, a nucleic acid Therapeutic can be 
introduced ,n,race„u,arl y and incorporated within host ce.l DMA for expression, 

by homologous recombination. 

In specific embodiments directed to treatment or prevention of 
20 partial disorders, preferably the following forms of administration are used: 

rirfrrrrrl Form' " f Administration 
pisorder 

Cervical cancer To P ical 
Gastrointestinal cancer Oral; intravenous 

Inhaled; intravenous 

Lung cancer 

25 Leukemia Intraven0US; extracor P orea, 

Metastatic carcinomas Intravenous; oral 

Targeted; intravenous: intrathecal 
Brain cancer b 

. , . Oral; intravenous 

Liver cirrhosis 



Psoriasis 
30 Keloids 
Baldness 



Topical 
Topical 
Topical 
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Spinal cord injury 
Parkinson's disease 



Motor neuron disease 



Alzheimer's disease 



Targeted; intravenous; intrathecal 
Targeted; intravenous; intrathecal 
Targeted; intravenous; intrathecal 
Targeted; intravenous; intrathecal 



5 



The present invention also provides pharmaceutical compositions. 



Such compositions comprise a therapeutically effective amount of a Therapeutic, 
and a pharmaceutical^ acceptable carrier or excipient. Such a carrier includes 
but is not limited to saline, buffered saline, dextrose, water, glycerol, ethanol, 

10 and combinations thereof. The carrier and composition can be sterile. The 
formulation should suit the mode of administration. 

The composition, if desired, can also contain minor amounts of 
wetting or emulsifying agents, or pH buffering agents. The composition can be a 
liquid solution, suspension, emulsion, tablet, pill, capsule, sustained release 

15 formulation, or powder. The composition can be formulated as a suppository, 
with traditional binders and carriers such as triglycerides. Oral formulation can 
include standard carriers such as pharmaceutical grades of mannitol, lactose, 
starch, magnesium stearate, sodium saccharine, cellulose, magnesium carbonate, 
etc. 

20 In a preferred embodiment, the composition is formulated in 

accordance with routine procedures as a pharmaceutical composition adapted for 
intravenous administration to human beings. Typically, compositions for 
intravenous administration are solutions in sterile isotonic aqueous buffer. Where 
necessary, the composition may also include a solubilizing agent and a local 

25 anesthetic such as lignocaine to ease pain at the site of the injection. Generally, 
the ingredients are supplied either separately or mixed together in unit dosage 
form, for example, as a dry lyophilized powder or water free concentrate in a 
hermetically sealed container such as an ampoule or sachette indicating the 
quantity of active agent. Where the composition is to be administered by 

30 infusion, it can be dispensed with an infusion bottle containing sterile 

pharmaceutical grade water or saline. Where the composition is administered by 
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injection, an ampoule of sterile water for injection or saline can be provided so 
that the ingredients may be mixed prior to administration. 

The Therapeutics of the invention can be formulated as neutral or 
salt forms. Pharmaceutical* acceptable salts include those formed with free 
5 amino groups such as those derived from hydrochloric, phosphoric, acet.c, oxahc, 
tartaric acids, etc., and those formed with free carboxyl groups such as those 
derived from sodium, potassium, ammonium, calcium, ferric hydroxys, 
isopropylamine, triethylamine, 2-ethylamino ethanol, histidine, procame, etc. 

The amount of the Therapeutic of the invention which w.ll be 
10 effective in the treatment of a particular disorder or condition will depend on the 
nature of the disorder or condition, and can be determined by standard c.inical 
techniques. In addition, in vitro assays may optionally be employed to help 
identify optimal dosage ranges. The precise dose to be employed in the 
formulation will also depend on the route of administration, and the seriousness of 
15 the disease or disorder, and should be decided according to the judgment of the 
practitioner and each patient's circumstances. However, suitable dosage ranges 
for intravenous administration are generally about 20-500 micrograms of acuve 
compound per kilogram body weight. Suitable dosage ranges for intranasal 
administration are generally about 0.01 pg/kg body weight to 1 mg/kg body 
20 weight. Effective doses may be extrapolated from dose-response curves denved 
from in vitro or animal model test systems. 

Suppositories generally contain active ingredient in the range of 
0.5% to 10% by weight; oral formulations preferably contain 10% to 95% active 
ingredient. 

25 The invention also provides a pharmaceutical pack or kit 

comprising one or more containers filed with one or more of the ingredients of 
the pharmaceutical compositions of the invention. Optionally associated with such 
container(s) can be a notice in the form prescribed by a governmental agency 
regulating the manufacture, use or sale of pharmaceuticals or biological products, 

30 which notice reflects approval by the agency of manufacture, use or sale for 
human administration. 
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5.5. ANTISENSE REGULATION OF NOTCH EXPRESSION 

The present invention provides the therapeutic or prophylactic use 
of nucleic acids of at least six nucleotides that are antisense to a gene or cDNA 
encoding Notch or a portion thereof. "Antisense" as used herein refers to a 
5 nucleic acid capable of hybridizing to a portion of a Notch RNA (preferably 
mRNA) by virtue of some sequence complementarity. Such antisense nucleic 
acids have utility as Antagonist Therapeutics of the invention, and can be used in 
the treatment or prevention of disorders as described supra in Section 5.1 and its 
subsections. 

10 The antisense nucleic acids of the invention can be oligonucleotides 

that are double-stranded or single-stranded, RNA or DNA or a modification or 
derivative thereof, which can be directly administered to a cell, or which can be 
produced intracellularly by transcription of exogenous, introduced sequences. 

In a specific embodiment, the Notch antisense nucleic acids 

15 provided by the instant invention can be used for the treatment of tumors or other 
disorders, the cells of which tumor type or disorder can be demonstrated (in vitro 
or in vivo) to express the Notch gene. Such demonstration can be by detection of 
Notch RNA or of Notch protein. 

The invention further provides pharmaceutical compositions 

20 comprising an effective amount of the Notch antisense nucleic acids of the 

invention in a pharmaceutical^ acceptable carrier, as described supra in Section 
5.4. Methods for treatment and prevention of disorders (such as those described 
in Sections 5.1 and 5.2) comprising administering the pharmaceutical 
compositions of the invention are also provided. 

25 i n another embodiment, the invention is directed to methods for 

inhibiting the expression of a Notch nucleic acid sequence in a prokaryotic or 
eukaryotic cell comprising providing the cell with an effective amount of a 
composition comprising an antisense Notch nucleic acid of the invention. 

In another embodiment, the identification of cells expressing 

30 functional Notch receptors can be carried out by observing the ability of Notch to 
"rescue" such cells from the cytotoxic effects of a Notch antisense nucleic acid. 
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In an alternative embodiment of the invention, nucleic acids 
antisense to a nucleic acid encoding a ("adhesive") toporythmic protein or 
fragment that binds to Notch, are envisioned as Therapeutics. 

Notch antisense nucleic acids and their uses are described in detail 



5 



below. 



5.5.1. NOTCH ANTISENSE NUCLEIC ACIDS 
The Notch antisense nucleic acids are of at least six nucleotides 



and are preferably oligonucleotides (ranging from 6 to about 50 oligonucleotides). 

10 In specific aspects, the oligonucleotide is at least 10 nucleotides, at least 15 
nucleotides, at least 100 nucleotides, or at least 200 nucleotides. The 
oligonucleotides can be DNA or RNA or chimeric mixtures or derivatives or 
modified versions thereof, single-stranded or double- stranded. The 
oligonucleotide can be modified at the base moiety, sugar moiety, or phosphate 

15 backbone. The oligonucleotide may include other appending groups such as 
peptides, or agents facilitating transport across the cell membrane (see, e.g., 
Letsinger et al., 1989, Proc. Natl. Acad. Sci. U.S.A. 86:6553-6556; Lemaitre et 
al., 1987, Proc. Natl. Acad. Sci. 84:648-652; PCT Publication No. 
WO 88/09810, published December 15, 1988) or blood-brain barrier (see, e.g., 

20 PCT Publication No. WO 89/10134, published April 25 ? 1988), hybridization- 
triggered cleavage agents (see, e.g., Krol et al., 1988, BioTechniques 6:958-976) 
or intercalating agents (see, e.g., Zon, 1988, Pharm. Res. 5:539-549), 



oligonucleotide is provided, preferably of single-stranded DNA. In a most 
25 preferred aspect, such an oligonucleotide comprises a sequence antisense to the 
sequence encoding ELR 11 and ELR 12 of Notch, most preferably, of human 
Notch. The oligonucleotide may be modified at any position on its structure with 
substituents generally known in the art. 



30 modified base moiety which is selected from the group including but not limited 
to 5-fluorouracil, 5-bromouraciI, 5-chIorouracil, 5-iodouraciU hypoxanthine, 



In a preferred aspect of the invention, a Notch antisense 



The Notch antisense oligonucleotide may comprise at least one 
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xantine, 4-acetyIcytosine, 5-(carboxyhydroxylmethyI) uracil, 
5-carboxymethyIaminomethyl-2-thiouridine, 5-carboxymethylaminomethyluracil, 
dihydrouracil, beta-D-galactosylqueosine, inosine, N6-isopentenyladenine, 

1- methylguanine, 1-methylinosine, 2,2-dimethyIguanine, 2-methyladenine, 
5 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 

7-methylguanine, 5-methylaminomethyluraciI, 5-methoxyaminomethyl- 

2- thiouracil, beta-D-mannosy Iqueosine, 5 '-methoxycarboxymethyluracil, 
5-methoxyuracil, 2-methyIthio-N6-isopentenyladenine, uracil-5-oxyacetic acid (v), 
wybutoxosine, pseudouracil, queosine, 2-thiocytosine, 5-methyI-2-thiouracil, 

10 2-thiouraciI, 4-thiouracil, 5-methyluracih uracil-5-oxyacetic acid methylester, 
uracil-5-oxyacetic acid (v) ? 5-methyl-2-thiouracil. 3-(3-amino-3-N-2- 
carboxy propyl) uracil, (acp3)w, and 2,6-diaminopurine. 

In another embodiment, the oligonucleotide comprises at least one 
modified sugar moiety selected from the group including but not limited to 

15 arabinose, 2-fluoroarabinose, xylulose, and hexose. 

In yet another embodiment, the oligonucleotide comprises at least 
one modified phosphate backbone selected from the group consisting of a 
phosphorothioate, a phosphorodithioate, a phosphoramidothioate, a 
phosphoramidate, a phosphordiamidate, a methylphosphonate, an alkyl 

20 phosphotriester, and a formacetal or analog thereof. 

In yet another embodiment, the oligonucleotide is an a-anomeric 
oligonucleotide. An a-anomeric oligonucleotide forms specific double-stranded 
hybrids with complementary RNA in which, contrary to the usual jS-units, the 
strands run parallel to each other (Gautier et al., 1987, Nucl. Acids Res. 

25 15:6625-6641). 

The oligonucleotide may be conjugated to another molecule, e.g., 
a peptide, hybridization triggered cross-linking agent, transport agent, 
hybridization-triggered cleavage agent, etc. 

Oligonucleotides of the invention may be synthesized by standard 
30 methods known in the art, e.g. by use of an automated DNA synthesizer (such as 
are commercially available from Biosearch, Applied Biosystems, etc.). As 
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examples, phosphorothioate oligos may be synthesized by the method of Stein et 
al. (1988, Nucl. Acids Res. 16:3209), methylphosphonate oligos can be prepared 
by use of controlled pore glass polymer supports (Sarin et al., 1988, Proc. Natl. 
Acad. Sci. U.S.A. 85:7448-7451), etc. 
5 In a specific embodiment, the Notch antisense oligonucleotide 

comprises catalytic RNA, or a ribozyme (see, e.g., PCT International Publication 
WO 90/11364, published October 4, 1990; Sarver et al., 1990, Science 247:1222- 
1225). In another embodiment, the oligonucleotide is a 2 , -0-methylribonucleotide 
(Inoue et al., 1987, Nucl. Acids Res. 15:6131-6148), or a chimeric RNA-DNA 
10 analogue (Inoue et al., 1987, FEBS Lett. 215:327-330). 

In an alternative embodiment, the Notch antisense nucleic acid of 
the invention is produced intracellular^ by transcription from an exogenous 
sequence. For example, a vector can be introduced in vivo such that it is taken 
up by a cell, within which cell the vector or a portion thereof is transcribed, 
15 producing an antisense nucleic acid (RNA) of the invention. Such a vector would 
contain a sequence encoding the Notch antisense nucleic acid. Such a vector can 
remain episomal or become chromosomally integrated, as long as it can be 
transcribed to produce the desired antisense RNA. Such vectors can be 
constructed by recombinant DNA technology methods standard in the art. 
20 Vectors can be plasmid, viral, or others known in the art, used for replication and 
expression in mammalian, cells. Expression of the sequence encoding the Notch 
antisense RNA can be by any promoter known in the art to act in mammalian, 
preferably human, cells. Such promoters can be inducible or constitutive. Such 
promoters include but are not limited to: the SV40 early promoter region 
25 (Bernoist and Chambon, 1981, Nature 290:304-310), the promoter contained in 
the 3' long terminal repeat of Rous sarcoma virus (Yamamoto et al., 1980, Cell 
22:787-797), the herpes thymidine kinase promoter (Wagner et al., 1981, Proc. 
Natl. Acad. Sci. U.S.A. 78:1441-1445), the regulatory sequences of the 
metal lothionein gene (Brinster et al., 1982, Nature 296:39-42), etc. 
30 The antisense nucleic acids of the invention comprise a sequence 

complementary to at least a portion of an RNA transcript of a Notch gene, 
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preferably a human Notch gene. However, absolute complementarity, although 
preferred, is not required. A sequence "complementary to at least a portion of an 
RNA," as referred to herein, means a sequence having sufficient complementarity 
to be able to hybridize with the RNA, forming a stable duplex; in the case of 

5 double-stranded Notch antisense nucleic acids, a single strand of the duplex DNA 
may thus be tested, or triplex formation may be assayed. The ability to hybridize 
will depend on both the degree of complementarity and the length of the antisense 
nucleic acid. Generally, the longer the hybridizing nucleic acid, the more base 
mismatches with a Notch RNA it may contain and still form a stable duplex (or 

10 triplex, as the case may be). One skilled in the art can ascertain a tolerable 

degree of mismatch by use of standard procedures to determine the melting point 
of the hybridized complex. 



IS 



20 



25 



30 



5 5 2. THERAPEUTIC UTILITY OF NOTCH 
ANTISENSE NUCLEIC ACIDS 

The Notch antisense nucleic acids can be used to treat (or prevent) 

malignancies, of a cell type which has been shown to express Notch RNA. 

Malignant, neoplastic, and pre-neoplastic cells which can be tested for such 

expression include but are not limited to those described supra in Sections 5.1.1 

and 5.2.1. In a preferred embodiment, a single-stranded DNA antisense Notch 

oligonucleotide is used. 

Malignant (particularly, tumor) cell types which express Notch 
RNA can be identified by various methods known in the art. Such methods 
include but are not limited to hybridization with a Notch-specific nucleic acid 
(e.g. by Northern hybridization, dot blot hybridization, in situ hybridization), 
observing the ability of RNA from the cell type to be translated in vitro into 
Notch, etc. In a preferred aspect, primary tumor tissue from a patient can be 
assayed for Notch expression prior to treatment. 

Pharmaceutical compositions of the invention (see Section 5.1,4), 
comprising an effective amount of a Notch antisense nucleic acid in a 
pharmaceutically acceptable carrier, can be administered to a patient having a 
malignancy which is of a type that expresses Notch RNA. 
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The amount of Notch antisense nucleic acid which will be effective 
in the treatment of a particular disorder or condition will depend on the nature of 
the disorder or condition, and can be determined by standard clinical techniques. 
Where possible, it is desirable to determine the antisense cytotoxicity of the tumor 
5 type to be treated in vitro, and then in useful animal model systems prior to 

testing and use in humans. 

In a specific embodiment, pharmaceutical compositions comprising 
Notch antisense nucleic acids are administered via liposomes, microparticles, or 
microcapsules. In various embodiments of the invention, it may be useful to use 
10 such compositions to achieve sustained release of the Notch antisense nucleic 

acids. In a specific embodiment, it may be desirable to utilize liposomes targeted 
via antibodies to specific identifiable tumor antigens (Leonetti et al., 1990, Proc. 
Natl. Acad. Sci. U.S.A. 87:2448-2451; Renneisen et al., 1990, J. Biol. Chem. 
265:16337-16342). 

15 

5.6. pi AGNOSTIC UTILITY 
Notch proteins, analogues, derivatives, and subsequences thereof, 
Notch nucleic acids (and sequences complementary thereto), anti-Notch 
antibodies, and other toporythmic proteins and derivatives and analogs thereof 

20 which interact with Notch proteins, and inhibitors of North-toporythmic protein 
interactions, have uses in diagnostics. Such molecules can be used in assays, 
such as immunoassays, to detect, prognose, diagnose, or monitor various 
conditions, diseases, and disorders affecting Notch expression, or monitor the 
treatment thereof. In particular, such an immunoassay is carried out by a method 

25 comprising contacting a sample derived from a patient with an anti-Notch 
antibody under conditions such that immunospecific binding can occur, and 
detecting or measuring the amount of any immunospecific binding by the 
antibody. In a specific embodiment, antibody to Notch can be used to assay in a 
patient tissue or serum sample for the presence of Notch where an aberrant level 

30 of Notch is an indication of a diseased condition. 
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The immunoassays which can be used include but are not limited 
to competitive and non-competitive assay systems using techniques such as 
western blots, radioimmunoassays, ELISA (enzyme linked immunosorbent assay), 
"sandwich" immunoassays, immunoprecipitation assays, precipitin reactions, gel 
5 diffusion precipitin reactions, immunodiffusion assays, agglutination assays, 
complement-fixation assays, immunoradiometric assays, fluorescent 
immunoassays, protein A immunoassays, to name but a few. 

Notch genes and related nucleic acid sequences and subsequences, 
including complementary sequences, and other toporythmic gene sequences, can 
10 also be used in hybridization assays. Notch nucleic acid sequences, or 

subsequences thereof comprising about at least 8 nucleotides, can be used as 
hybridization probes. Hybridization assays can be used to detect, prognose, 
diagnose, or monitor conditions, disorders, or disease states associated with 
aberrant changes in Notch expression and/or activity as described supra. In 
15 particular, such a hybridization assay is carried out by a method comprising 

contacting a sample containing nucleic acid with a nucleic acid probe capable of 
hybridizing to Notch DNA or RNA, under conditions such that hybridization can 
occur, and detecting or measuring any resulting hybridization. 

As detailed in examples section 10.1 infra, increased Notch 
20 expression occurs in human breast, colon, and cervical cancer. Accordingly, in 
specific embodiments, human breast, colon, or cervical cancer or premalignant 
changes in such tissues is diagnosed by detecting increased Notch expression (or 
amount) in patient samples relative to the level of Notch expression (or amount) 
in an analogous non-malignant, or non-premalignant, as the case may be, sample 
25 (from the patient or another person, as determined experimentally or as is known 
as a standard level in such samples). 

In one embodiment, the Notch protein (or derivative having Notch 
antigenicity) that is detected or measured is on the cell surface. In another 
embodiment, the Notch protein (or derivative) is a cell free soluble molecule 
30 {e.g. , as measured in a blood or serum sample) or is intracellular. Without 

intending to be bound mechanistically, Applicants believe that cell free Notch may 
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result from secretion or shedding from the cell surface. In yet another 
embodiment, soluble, cell-surface, and intracellular amounts of Notch protein or 
derivative are detected or measured. 

5 5.7. NOTI-H NUCL EIC ACIDS 

Therapeutics of the invention which are Notch nucleic acids or 
Notch antisense nucleic acids, as well as nucleic acids encoding protein 
Therapeutics, include those described below, which can be obtained by methods 
known in the art, and in particular, as described below. 

10 In particular aspects, the invention provides amino acid sequences 

of Notch, preferably human Notch, and fragments and derivatives thereof which 
comprise an antigenic determinant (i.e., can be recognized by an antibody) or 
which are functionally active, as well as nucleic acid sequences encoding the 
foregoing. "Functionally active" material as used herein refers to that material 

15 displaying one or more known functional activities associated with the full-length 
(wild-type) Notch protein product, e.g., binding to Delta, binding to Serrate, 
binding to any other Notch ligand, antigenicity (binding to an anti-Notch 
antibody), etc. 

In specific embodiments, the invention provides fragments of a 
20 Notch protein consisting of at least 40 amino acids, or of at least 75 amino acids. 
In other embodiments, the proteins comprise or consist essentially of the 
intracellular domain, transmembrane region, extracellular domain, cdclO region, 
Notch /lin-12 repeats, or the EGF-homologous repeats, or any combination of the 
foregoing, of a Notch protein. Fragments, or proteins comprising fragments, 
25 lacking some or all of the EGF-homologous repeats of Notch are also provided. 
Nucleic acids encoding the foregoing are provided. 

In other specific embodiments, the invention provides nucleotide 
sequences and subsequences of Notch, preferably human Notch, consisting of at 
least 25 nucleotides, at least 50 nucleotides, or at least 150 nucleotides. Nucleic 
30 acids encoding the proteins and protein fragments described above are provided, 
as well as nucleic acids complementary to and capable of hybridizing to such 
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nucleic acids. In one embodiment, such a complementary sequence may be 
complementary to a Notch cDNA sequence of at least 25 nucleotides, or of at 
least 100 nucleotides. In a preferred aspect, the invention utilizes cDNA 
sequences encoding human Notch or a portion thereof. In a specific embodiment, 

5 such sequences of the human Notch gene or cDNA are as contained in plasmids 
hN3k, hN4k, or hN5k (see Section 9, infra) or in the gene corresponding thereto; 
such a human Notch protein sequence can be as shown in Figures 10 (SEQ ID 
NO: 11) or 11 (SEQ ID NO: 13). In other embodiments, the Notch nucleic acid 
and/or its encoded protein has at least a portion of the sequence shown in one of 

10 the following publications: Wharton et al., 1985, Cell 43:567-581 (Drosophila 
Notch); Kidd et al., 1986, Mol. Cell. Biol. 6:3094-3108 (Drosophila Notch); 
Coffman et al., 1990, Science 249:1438-1441 (Xenopus Notch); Ellisen et al., 
1991, Cell 66:649-661 (a human Notch). In another aspect, the sequences of 
human Notch are those encoding the human Notch amino acid sequences or a 

15 portion thereof as shown in Figure 13. In a particular aspect, the human Notch 
sequences are those of the hN homolog (represented in part by plasmid hN5k) or 

the TAN-1 homolog. 

In one embodiment of the invention, the invention is directed to 
the full-length human Notch protein encoded by the hN homolog as depicted in 
20 Figure 13, both containing the signal sequence (i.e. , the precursor protein; amino 
acids 1-2169) and lacking the signal sequence (i.c, the mature protein; amino 
acids -26-2169), as well as portions of the foregoing (e.g. , the extracellular 
domain, EGF homologous repeat region, EGF-like repeats 11 and 12, 
cdc-10/ankyrin repeats, etc.) and proteins comprising the foregoing, as well as 
25 nucleic acids encoding the foregoing. 

As is readily apparent, as used herein, a "nucleic acid encoding a 
fragment or portion of a Notch protein" shall be construed as referring to a 
nucleic acid encoding only the recited fragment or portion of the Notch protein 
and not other portions of the Notch protein. 
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In a preferred, but not limiting, aspect of the invention, a human 
Notch DNA sequence can be cloned and sequenced by the method described in 

Section 9, infra. 

In another preferred aspect, PCR is used to amplify the desired 
5 sequence in the library, prior to selection. For example, oligonucleotide primers 
representing part of the adhesive domains encoded by a homologue of the desired 
gene can be used as primers in PCR. 

The above-methods are not meant to limit the following general 
description of methods by which clones of Notch may be obtained. 
10 Any eukaryotic cell can potentially serve as the nucleic acid source 

for the molecular cloning of the Motch gene. The DNA may be obtained by 
standard procedures known in the art from cloned DNA (e.g., a DNA "library"), 
by chemical synthesis, by cDNA cloning, or by the cloning of genomic DNA, or 
fragments thereof, purified from the desired human cell (see, for example 
15 Sambrook et al. , 1989, Molecular Cloning, A Laboratory Manual, Cold Spring 
Harbor Laboratory, 2d. Ed., Cold Spring Harbor, New York; Glover, D.M. 
(ed.), 1985, DNA Cloning: A Practical Approach, MRL Press, Ltd., Oxford, 
U.K. Vol. I, H.) Clones derived from genomic DNA may contain regulatory and 
intron DNA regions in addition to coding regions; clones derived from cDNA 
20 will contain only exon sequences. Whatever the source, the gene should be 
molecularly cloned into a suitable vector for propagation of the gene. 

In the molecular cloning of the gene from genomic DNA, DNA 
fragments are generated, some of which will encode the desired gene. The DNA 
may be cleaved at specific sites using various restriction enzymes. Alternatively, 
25 one may use DNAse in the presence of manganese to fragment the DNA, or the 
DNA can be physically sheared, as for example, by sonication. The linear DNA 
fragments can then be separated according to size by standard techniques, 
including but not limited to, agarose and polyacrylamide gel electrophoresis and 
column chromatography. 
30 Once the DNA fragments are generated, identification of the 

specific DNA fragment containing the desired gene may be accomplished in a 



35 



PCT/US93/09338 
WO 94/07474 ^ 

- 41 - 



number of ways. For example, if an amount of a portion of a Notch (of any 
species) gene or its specific RNA, or a fragment thereof e.g., the adhesive 
domain, is available and can be purified and labeled, the generated DNA 
fragments may be screened by nucleic acid hybridization to the labeled probe 
5 (Benton, W. and Davis, R., 1977, Science 196, 180; Grunstein, M. And 
Hogness, D., 1975, Proc. Natl. Acad. Sci. U.S.A. 72, 3961). Those DNA 
fragments with substantial homology to the probe will hybridize. It is also 
possible to identify the appropriate fragment by restriction enzyme digestion(s) 
and comparison of fragment sizes with those expected according to a known 
10 restriction map if such is available. Further selection can be carried out on the 
basis of the properties of the gene. Alternatively, the presence of the gene may 
be detected by assays based on the physical, chemical, or immunological 
properties of its expressed product. For example, cDNA clones, or DNA clones 
which hybrid-select the proper mRNAs, can be selected which produce a protein 
15 that, e.g. , has similar or identical electrophoretic migration, isolectric focusing 
behavior, proteolytic digestion maps, in vitro aggregation activity 
("adhesiveness") or antigenic properties as known for Notch. If an antibody to 
Notch is available, the Notch protein may be identified by binding of labeled 
antibody to the putatively Notch synthesizing clones, in an EL1SA (enzyme-linked 
20 immunosorbent assay)-type procedure. 

The Notch gene can also be identified by mRNA selection by 
nucleic acid hybridization followed by in vitro translation. In this procedure, 
fragments are used to isolate complementary mRNAs by hybridization. Such 
DNA fragments may represent available, purified NoM DNA of another species 
25 [e.g. , Drosophila). lmmunoprecipitation analysis or functional assays (e.g. , 

aggregation ability in vitro; see examples infra) of the in vitro translation products 
of the isolated products of the isolated mRNAs identifies the mRNA and, 
therefore, the complementary DNA fragments that contain the desired sequences. 
In addition, specific mRNAs may be selected by adsorption of polysomes isolated 
30 from cells to immobilized antibodies specifically directed against Notch or Delta 
protein. A radiolabeled Notch cDNA can be synthesized using the selected 
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mR NA (from the adsorbed polysomes) as a template. The radiolabeled mRNA 

or cDNA may then be used as a probe to identify the Notch DNA fragments from 

among other genomic DNA fragments. 

Alternatives to isolating the Notch genomic DNA include, but are 

not limited to. chemically synthesizing the gene sequence itself from a known 
sequence or making cDNA to the mRNA which encodes the Notch gene. For 
example, RNA for cDNA cloning of the Notch gene can be isolated from cells 
which express Notch. Other methods are possible and within the scope of the 



10 



invention. 

The identified and isolated gene can then be inserted into an 
appropriate cloning vector. A large number of vector-host systems known in the 
art may be used. Possible vectors include, but are not limited to, plasnuds or 
modified viruses, but the vector system must be compatible with the host cell 
used Such vectors include, but are not limited to, bacteriophages such as lambda 
15 derivatives, or plasmids such as PBR322 or P UC plasmid derivatives. The 

insertion into a Coning vector can, for example, be accomplished by ligating the 
DNA fragment into a cloning vector which has complementary cohesive termm,. 
However, if the complementary restriction sites used to fragment the DNA are 
not present in the cloning vector, the ends of the DNA molecules may be 
20 enzymatically modified. Alternative*, any site desired may be produced by 

Jigating nucleotide sequences (linkers) onto the DNA termini: these ligated hnkers 
may comprise specific chemically synthesized oligonucleotides encod.ng 
restriction endonuclease recognition sequences. In an alternative method, the 
cleaved vector and Notch or Delta gene may be modified by homopolymeric 
25 tailing Recombinant molecules can be introduced into host cells via 

transformation, transfection, infection, electroporation, etc., so that many cop.es 

of the gene sequence are generated. 

in sin alternative method, the desired gene may be identified and 
isolated after insertion into a suitable cloning vector in a "shot gun" approach. 
30 Enrichment for the desired gene, for example, by size fractionization, can be 
done before insertion into the cloning vector. 
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In specific embodiments, transformation of host cells with 
recombinant DNA molecules that incorporate the isolated Notch gene, cDNA, or 
synthesized DNA sequence enables generation of multiple copies of the gene. 
Thus, the gene may be obtained in large quantities by growing transformants, 

5 isolating the recombinant DNA molecules from the transformants and, when 
necessary, retrieving the inserted gene from the isolated recombinant DNA. 

The Notch sequences provided by the instant invention include 
those nucleotide sequences encoding substantially the same amino acid sequences 
as found in native Notch protein, and those encoded amino acid sequences with 

10 functionally equivalent amino acids, all as described in Section 5.6 infra for 

Notch derivatives. 

Similar methods to those described supra can be used to obtain a 
nucleic acid encoding Delta, Serrate, or adhesive portions thereof, or other 
toporythmic gene of interest. In a specific embodiment, the Delta nucleic acid 
IS has at least a portion of the sequence shown in Figure 1 (SEQ ID NO: l). In 

another specific embodiment, the Serrate nucleic acid has at least a portion of the 
sequence shown in Figure 5 (SEQ ID NO:3). The nucleic acid sequences 
encoding toporythmic proteins can be isolated from porcine, bovine, feline, avian, 
equine, or canine, as well as primate sources and any other species in which 
20 homologs of known toporythmic genes [including but not limited to the following 
genes (with the publication of sequences in parentheses): Delta (Vassin et al., 
1987, EMBO J. 6, 3431-3440; Kopczynski et al., 1988, Genes Dev. 2, 1723- 
1735; note corrections to the Kopczynski et al. sequence found in Figure 1 hereof 
(SEQ ID NO:l and SEQ ID NO:2)) and Serrate (Fleming et al., 1990, Genes & 
25 Dev. 4, 2188-2201)] can be identified. Such sequences can be altered by 
substitutions, additions or deletions that provide for functionally equivalent 
molecules, as described supra. 

5.8. RFPOMBINANT PRODUCTION OF PROTEIN THERAPEUTICS 
30 The nucleic acid coding for a protein Therapeutic of the invention 

can be inserted into an appropriate expression vector, i.e., a vector which 
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contains the necessary elements for the transcription and translation of the inserted 
protein-coding sequence. The necessary transcriptional and translational signals 
can also be supplied by the native toporythmic gene and/or its flanking regions. 
A variety of host-vector systems may be utilized to express the protein-coding 
5 sequence. These include but are not limited to mammalian cell systems infected 
with virus {e.g., vaccinia virus, adenovirus, etc.); insect cell systems infected 
with virus (e.g., baculovirus); microorganisms such as yeast containing yeast 
vectors, or bacteria transformed with bacteriophage, DNA, plasmid DNA, or 
cosmid DNA. The expression elements of vectors vary in their strengths and 
10 specificities. Depending on the host-vector system utilized, any one of a number 
of suitable transcription and translation elements may be used. In a specific 
embodiment, the adhesive portion of the Notch gene, e.g., that encoding EGF- 
like repeats (ELR) 11 and 12, is expressed. In other specific embodiments, the 
human Notch gene is expressed, or a sequence encoding a functionally active 

15 portion of human Notch. 

Any of the methods previously described for the insertion of DNA 
fragments into a vector may be used to construct expression vectors containing a 
chimeric gene consisting of appropriate transcriptional/translational control signals 
and the protein coding sequences. These methods may include in vitro 

20 recombinant DNA and synthetic techniques and in vivo recombinants (genetic 

recombination). Expression of nucleic acid sequence encoding a Notch protein or 
peptide fragment may be regulated by a second nucleic acid sequence so that the 
Notch protein or peptide is expressed in a host transformed with the recombinant 
DNA molecule. For example, expression of a Notch protein may be controlled 

25 by any promoter/enhancer element known in the art. Promoters which may be 
used to control toporythmic gene expression include, but are not limited to, the 
SV40 early promoter region (Bernoist and Chambon, 1981, Nature 290, 304- 
310), the promoter contained in the 3' long terminal repeat of Rous sarcoma virus 
(Yamamoto, et al., 1980, Cell 22, 787-797), the herpes thymidine kinase 

30 promoter (Wagner et al., 1981, Proc. Natl. Acad. Sci. U.S.A. 78, 1441-1445), 
the regulatory sequences of the metallothionein gene (Brinster et al., 1982, Nature 
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296, 39-42); prokaryotic expression vectors such as the /^-lactamase promoter 
(Villa-Kamaroff, et al., 1978, Proc. Natl. Acad. Sci. U.S.A. 75, 3727-3731), or 
the *2£ promoter (DeBoer, et al., 1983, Proc. Natl. Acad. Sci. U.S.A. 80, 21- 
25); see also "Useful proteins from recombinant bacteria" in Scientific American, 
5 1980, 242, 74-94; plant expression vectors comprising the nopaline synthetase 
promoter region (Herrera-Estrella et al., Nature 303, 209-213) or the cauliflower 
mosaic virus 35S RNA promoter (Gardner, et al., 1981, Nucl. Acids Res. 9, 
2871), and the promoter of the photosynthetic enzyme ribulose biphosphate 
carboxylase (Herrera-Estrella et al., 1984, Nature 310, 115-120); promoter 
10 elements from yeast or other fungi such as the Gal 4 promoter, the ADC (alcohol 
dehydrogenase) promoter, PGK (phosphoglycerol kinase) promoter, alkaline 
phosphatase promoter, and the following animal transcriptional control regions, 
which exhibit tissue specificity and have been utilized in transgenic animals: 
elastase 1 gene control region which is active in pancreatic acinar cells (Swift et 
15 al., 1984, Cell 38, 639-646; Ornitz et al., 1986, Cold Spring Harbor Symp. 
Quant. Biol. 50, 399-409; MacDonald, 1987, Hepatology 7, 425-515); insulin 
gene control region which is active in pancreatic beta cells (Hanahan, 1985, 
Nature 315, 115-122), immunoglobulin gene control region which is active in 
lymphoid cells (Grosschedl et al., 1984, Cell 38, 647-658; Adames et al., 1985, 
20 Nature 318, 533-538; Alexander et al., 1987, Mol. Cell. Biol. 7, 1436-1444), 
mouse mammary tumor virus control region which is active in testicular, breast, 
lymphoid and mast cells (Leder et al., 1986, Cell 45, 485-495), albumin gene 
control region which is active in liver (Pinkert et al., 1987, Genes and Devel. 1, 
268-276), alpha-fetoprotein gene control region which is active in liver (Krumlauf 
25 et al., 1985, Mol. Cell. Biol. 5, 1639-1648; Hammer et al., 1987, Science 235, 
53-58; alpha 1-antitrypsin gene control region which is active in the liver (Kelsey 
et al., 1987, Genes and Devel. 1, 161-171), beta-globin gene control region 
which is active in myeloid cells (Mogram et al., 1985, Nature 315, 338-340; 
Kollias et al., 1986, Cell 46, 89-94; myelin basic protein gene control region 
30 which is active in oligodendrocyte cells in the brain (Readhead et al., 1987, Cell 
48, 703-712); myosin light chain-2 gene control region which is active in skeletal 



35 



_ PCT/US93/09338 
WO 94/07474 ^ 

- 46 - 



muscle (Sani, 1985, Nature 314, 283-286), and gonadotropic releasing hormone 
gene control region which is active in the hypothalamus (Mason et ah, 1986. 
Science 234, 1372-1378). 

Expression vectors containing Notch gene inserts can be identified 
5 by three general approaches: (a) nucleic acid hybridization, (b) presence or 

absence of "marker" gene functions, and (c) expression of inserted sequences. In 
the first approach, the presence of a foreign gene inserted in an expression vector 
can be detected by nucleic acid hybridization using probes comprising sequences 
that are homologous to an inserted toporythmic gene. In the second approach, the 
10 recombinant vector/host system can be identified and selected based upon the 
presence or absence of certain "marker" gene functions {e.g., thymidine kinase 
activity, resistance to antibiotics, transformation phenotype, occlusion body 
formation in baculovirus, etc.) caused by the insertion of foreign genes in the 
vector. For example, if the Notch gene is inserted within the marker gene 
15 sequence of the vector, recombinants containing the Notch insert can be identified 
by the absence of the marker gene function. In the third approach, recombinant 
expression vectors can be identified by assaying the foreign gene product 
expressed by the recombinant. Such assays can be based, for example, on the 
physical or functional properties of the Notch gene product in ]n vUro assay 
20 systems, e.g.. aggregation (adhesive) ability (see Sections 6-7, infra). 

Once a particular recombinant DNA molecule is identified and 
isolated, several methods known in the art may be used to propagate it. Once a 
suitable host system and growth conditions are established, recombinant 
expression vectors can be propagated and prepared in quantity. As previously 
25 explained, the expression vectors which can be used include, but are not limited 
to, the following vectors or their derivatives: human or animal viruses such as 
vaccinia virus or adenovirus; insect viruses such as baculovirus; yeast vectors; 
bacteriophage vectors (e.g., lambda), and plasmid and cosmid DNA vectors, to 
name but a few. 

30 In addition, a host cell strain may be chosen which modulates the 

expression of the inserted sequences, or modifies and processes the gene product 
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in the specific fashion desired. Expression from certain promoters can be 
elevated in the presence of certain inducers; thus, expression of the genetically 
engineered Notch protein may be controlled. Furthermore, different host cells 
have characteristic and specific mechanisms for the translational and post- 
5 translational processing and modification (e.g., glycosylation, cleavage) of 
proteins. Appropriate cell lines or host systems can be chosen to ensure the 
desired modification and processing of the foreign protein expressed. For 
example, expression in a bacterial system can be used to produce an 
unglycosylated core protein product. Expression in yeast will produce a 
10 glycosylated product. Expression in mammalian cells can be used to ensure 
"native" glycosylation of a heterologous mammalian toporythmic protein. 
Furthermore, different vector/host expression systems may effect processing 
reactions such as proteolytic cleavages to different extents. 

In other specific embodiments, the Notch protein, fragment, 
IS analog, or derivative may be expressed as a fusion, or chimeric protein product 
(comprising the protein, fragment, analog, or derivative joined via a peptide bond 
to a heterologous protein sequence (of a different protein)). Such a chimeric 
product can be made by ligating the appropriate nucleic acid sequences encoding 
the desired amino acid sequences to each other by methods known in the art, in 
20 the proper coding frame, and expressing the chimeric product by methods 

commonly known in the art. Alternatively, such a chimeric product may be made 
by protein synthetic techniques, e.g., by use of a peptide synthesizer. 

Both cDNA and genomic sequences can be cloned and expressed. 
In other embodiments, a Notch cDNA sequence may be 
25 chromosomally integrated and expressed. Homologous recombination procedures 
known in the art may be used. 



5 8.1. IDENTIFICATION AND PURIFICATION 
OF THE EXPRESSED GENE PRODUCT 

30 Once a recombinant which expresses the Notch gene sequence is 

identified, the gene product may be analyzed. This can be achieved by assays 
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based on the physical or functional properties of the product, including 
radioactive labelling of the product followed by analysis by gel electrophoresis. 

Once the Notch protein is identified, it may be isolated and 
purified by standard methods including chromatography (e.g., ion exchange, 
5 affinity, and sizing column chromatography), centrifugation, differential 

solubility, or by any other standard technique for the purification of proteins. 
The functional properties may be evaluated using any suitable assay, including, 
but not limited to, aggregation assays (see Sections 6-7). 

10 5.9. DERIVATIVES AND ANALOGS OF NOTCH 

AND OTHER TQPQRYTHMIC PROTEINS 

The invention further provides, as Therapeutics, derivatives 

(including but not limited to fragments) and analogs of Notch proteins. Also 

provided as Therapeutics are other toporythmic proteins and derivatives and 

15 analogs thereof, or Notch ligands, in particular, which promote or, alternatively, 
inhibit the interactions of such other toporythmic proteins with Notch. 

The production and use of derivatives and analogs related to Notch 
are within the scope of the present invention. In a specific embodiment, the 
derivative or analog is functionally active, i.e., capable of exhibiting one or more 

2q functional activities associated with a full-length, wild-type Notch protein. As 
one example, such derivatives or analogs which have the desired antigenicity can 
be used, for example, in diagnostic immunoassays as described in Section 5.3. 
Molecules which retain, or alternatively inhibit, a desired Notch property, e.g., 
binding to Delta or other toporythmic proteins, binding to a intracellular ligand, 

25 can be used therapeutically as inducers, or inhibitors, respectively, of such 

property and its physiological correlates. Derivatives or analogs of Notch can be 
tested for the desired activity by procedures known in the art, including but not 
limited to the assays described infra. In one specific embodiment, peptide 
libraries can be screened to select a peptide with the desired activity; such 

30 screening can be carried out by assaying, e.g., for binding to Notch or a Notch 
binding partner such as Delta. 
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In particular, Notch derivatives can be made by altering Notch 
sequences by substitutions, additions or deletions that provide for functionally 
equivalent molecules. Due to the degeneracy of nucleotide coding sequences, 
other DNA sequences which encode substantially the same amino acid sequence 

5 as a Notch gene may be used in the practice of the present invention. These 
include but are not limited to nucleotide sequences comprising all or portions of 
Notch genes which are altered by the substitution of different codons that encode 
a functionally equivalent amino acid residue within the sequence, thus producing a 
silent change. Likewise, the Notch derivatives of the invention include, but are 

10 not limited to, those containing, as a primary amino acid sequence, all or part of 
the amino acid sequence of a Notch protein including altered sequences in which 
functionally equivalent amino acid residues are substituted for residues within the 
sequence resulting in a silent change. For example, one or more amino acid 
residues within the sequence can be substituted by another amino acid of a similar 

15 polarity which acts as a functional equivalent, resulting in a silent alteration. 
Substitutes for an amino acid within the sequence may be selected from other 
members of the class to which the amino acid belongs. For example, the 
nonpolar (hydrophobic) amino acids include alanine, leucine, isoleucine, valine, 
proline, phenylalanine, tryptophan and methionine. The polar neutral amino acids 

20 include glycine, serine, threonine, cysteine, tyrosine, asparagine, and glutamine. 
The positively charged (basic) amino acids include arginine, lysine and histidine. 
The negatively charged (acidic) amino acids include aspartic acid and glutamic 
acid. 

Derivatives or analogs of Notch include but are not limited to 
25 those peptides which are substantially homologous to Notch or fragments thereof, 
or whose encoding nucleic acid is capable of hybridizing to a Notch nucleic acid 
sequence. 

The Notch derivatives and analogs of the invention can be 
produced by various methods known in the art. The manipulations which result 
30 in their production can occur at the gene or protein level. For example, the 
cloned Notch gene sequence can be modified by any of numerous strategies 



35 



_ PCT/US93/09338 
WO 94/07474 $Q 



known in the art (Maniatis, T., 1989. Molecular Cloning, A Laboratory Manual, 
2d ed., Cold Spring Harbor Laboratory, Cold Spring Harbor, New York). The 
sequence can be cleaved at appropriate sites with restriction endonuclease(s), 
followed by further enzymatic modification if desired, isolated, and ligated in 
5 vitro. In the production of the gene encoding a derivative or analog of Notch, 
care should be taken to ensure that the modified gene remains within the same 
translational reading frame as Notch, uninterrupted by translation^ stop signals, 
in the gene region where the desired Notch activity is encoded. 

Additionally, the Notch-encoding nucleic acid sequence can be 
10 mutated in vitro or in vivo, to create and/or destroy translation, initiation, and/or 
termination sequences, or to create variations in coding regions and/or form new 
restriction endonuclease sites or destroy preexisting ones, to facilitate further in 
vitro modification. Any technique for mutagenesis known in the art can be used, 
including but not limited to, in vitro site-directed mutagenesis (Hutchinson, C, et 
15 al., 1978, J. Biol. Chem 253:6551), use of TAB® linkers (Pharmacia), etc. 

Manipulations of the Notch sequence may also be made at the 
protein level. Included within the scope of the invention are Notch protein 
fragments or other derivatives or analogs which are differentially modified during 
or after translation, e.g., by glycosylation, acetylation, phosphorylation, 
20 amidation, derivatization by known protecting/blocking groups, proteolytic 

cleavage, linkage to an antibody molecule or other cellular ligand, etc. Any of 
numerous chemical modifications may be carried out by known techniques, 
including but not limited to specific chemical cleavage by cyanogen bromide, 
trypsin, chymotrypsin, papain, V8 protease, NaBH<; acetylation, formylation. 
25 oxidation, reduction; metabolic synthesis in the presence of tunicamycin; etc. 

In addition, analogs and derivatives of Notch can be chemically 
synthesized. For example, a peptide corresponding to a portion of a Notch 
protein which comprises the desired domain, or which mediates the desired 
aggregation activity in vitro, or binding to a receptor, can be synthesized by use 
30 of a peptide synthesizer. Furthermore, if desired, nonclassical amino acids or 
chemical amino acid analogs can be introduced as a substitution or addition into 



35 



WO 94/07474 



- 51 - 



PCT/US93/09338 



the Notch sequence. Non-classical amino acids include but are not limited to the 
D-isomers of the common amino acids, a-amino isobutyric acid, 4-aminobutyric 
acid, hydroxyproline, sarcosine, citrulline, cysteic acid, t-butylglycine, 
t-butylalanine, phenylglycine, cyclohexylalanine, /3-alanine, designer amino acids 
5 such as /J-methyl amino acids, Ca-methyl amino acids, and Na-methyl amino 
acids. 

In a specific embodiment, the Notch derivative is a chimeric, or 
fusion, protein comprising a Notch protein or fragment thereof fused via a peptide 
bond at its amino- and/or carboxy-terminus to a non-Notch amino acid sequence. 

10 In one embodiment, such a chimeric protein is produced by recombinant 

expression of a nucleic acid encoding the protein (comprising a Notch-coding 
sequence joined in-frame to a non-Notch coding sequence). Such a chimeric 
product can be made by ligating the appropriate nucleic acid sequences encoding 
the desired amino acid sequences to each other by methods known in the art, in 

IS the proper coding frame, and expressing the chimeric product by methods 

commonly known in the art. Alternatively, such a chimeric product may be made 
by protein synthetic techniques, e.g., by use of a peptide synthesizer. In a 
specific embodiment, a chimeric nucleic acid encoding a mature Notch protein 
with a heterologous signal sequence is expressed such that the chimeric protein is 

20 expressed and processed by the cell to the mature Notch protein. As another 
example, and not by way of limitation, a recombinant molecule can be 
constructed according to the invention, comprising coding portions of both Notch 
and another toporythmic gene, e.g., Delta. The encoded protein of such a 
recombinant molecule could exhibit properties associated with both Notch and 

25 Delta and portray a novel profile of biological activities, including agonists as 
well as antagonists. The primary sequence of Notch and Delta may also be used 
to predict tertiary structure of the molecules using computer simulation (Hopp and 
Woods, 1981, Proc. Natl. Acad. Sci. U.S.A. 78:3824-3828); Notch/Delta 
chimeric recombinant genes could be designed in light of correlations between 

30 tertiary structure and biological function. Likewise, chimeric genes comprising 
portions of Notch fused to any heterologous (non-Notch) protein-encoding 
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sequences may be constructed. A specific embodiment relates to a chimeric 
protein comprising a fragment of Notch of at least six amino acids. 

In another specific embodiment, the Notch derivative is a fragment 
of Notch comprising a region of homology with another toporythmic protein. As 

5 used herein, a region of a first protein shall be considered "homologous" to a 
second protein when the amino acid sequence of the region is at least 30% 
identical or at least 75% either identical or involving conservative changes, when 
compared to any sequence in the second protein of an equal number of amino 
acids as the number contained in the region. 

10 Derivatives of Serrate, Delta, other toporythmic proteins, and the 

adhesive portions thereof, can be made by methods similar to those described 
supra. 
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5 9 1 DERIVATIVES OF NOTCH CONTAINING 

DNF. OR MORE DOMAINS OF TH E PROTEIN 

In a specific embodiment, the invention provides Therapeutics that 
are Notch derivatives and analogs, in particular Notch fragments and derivatives 
of such fragments, that comprise one or more domains of the Notch protein, 
including but not limited to the extracellular domain, transmembrane domain, 
intracellular domain, membrane-associated region, one or more of the EGF-like 
repeats (ELR) of the Notch protein, the cdclO repeats, and the JNotch/lin-12 
repeats. In specific embodiments, the Notch derivative may lack all or a portion 
of the ELRs, or one or more other regions of the protein. 

In a specific embodiment, relating to a Notch protein of a species 
other than D. melanogaster, preferably human, fragments comprising specific 
portions of Notch are those comprising portions in the respective Notch protein 
most homologous to specific fragments of the Drosophila Notch protein {e.g., 
ELR 11 and ELR 12). 
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5 9 2. DERIVATIVES OF NOTCH OR OTHER 

TOPORYTHMIC PROTEINS THAT MEDIATE 
BINDING TO TOPORYTHMIC PROTEIN 
DOMAINS. AND INHIBITO RS THEREOF 

The invention also provides Notch fragments, and analogs or 
5 derivatives of such fragments, which mediate binding to toporythmic proteins (and 
thus are termed herein "adhesive"), and nucleic acid sequences encoding the 
foregoing. 

Also included as Therapeutics of the invention are toporythmic 
(e.g., Delta, Serrate) protein fragments, and analogs or derivatives thereof, which 
10 mediate heterotypic binding to Notch (and thus are termed herein "adhesive"), 
and nucleic acid sequences relating to the foregoing. 

Also included as Therapeutics of the invention are inhibitors (e.g. , 
peptide inhibitors) of the foregoing toporythmic protein interactions with Notch. 

The ability to bind to a toporythmic protein can be demonstrated 
15 by in vitro aggregation assays with cells expressing such a toporythmic protein as 
well as cells expressing Notch or a Notch derivative (See Section 6). That is, the 
ability of a protein fragment to bind to a Notch protein can be demonstrated by 
detecting the ability of the fragment, when expressed on the surface of a first ceil, 
to bind to a Notch protein expressed on the surface of a second cell. Inhibitors of 
20 the foregoing interactions can be detected by their ability to inhibit such 

aggregation in vitro. 

The nucleic acid sequences encoding toporythmic proteins or 
adhesive domains thereof, for use in such assays, can be isolated from human, 
porcine, bovine, feline, avian, equine, canine, or insect, as well as primate 
25 sources and any other species in which homologs of known toporythmic genes 

can be identified. 

In a specific embodiment, the adhesive fragment of Notch is that 
comprising the portion of Notch most homologous to ELR 1 1 and 12, i.e., amino 
acid numbers 447 through 527 (SEQ ID NO: 14) of the Drosophila Notch 
30 sequence (see Figure 4). In yet another specific embodiment, the adhesive 

fragment of Delta mediating binding to Notch is that comprising the portion of 
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Delta most homologous to about amino acid numbers 1-230 of the Drosophila 
Delta sequence (SEQ ID NO:2). In a specific embodiment relating to an adhesive 
fragment of Serrate, such fragment is that comprising the portion of Serrate most 
homologous to about amino acid numbers 85-283 or 79-282 of the Drosophila 
5 Serrate sequence (see Figure 5 (SEQ ID NO:4)). 

Due to the degeneracy of nucleotide coding sequences, other DNA 
sequences which encode substantially the same amino acid sequence as the 
adhesive sequences may be used in the practice of the present invention. These 
include but are not limited to nucleotide sequences comprising all or portions of 
10 the Notch . Delta , or Serrate genes which are altered by the substitution of 

different codons that encode a functionally equivalent amino acid residue within 
the sequence, thus producing a silent change. Likewise, the adhesive protein 
fragments or derivatives thereof, of the invention include, but are not limited to, 
those containing, as a primary amino acid sequence, all or part of the amino acid 
15 sequence of the adhesive domains including altered sequences in which 

functionally equivalent amino acid residues are substituted for residues within the 
sequence resulting in a silent change. 

Adhesive fragments of toporythmic proteins and potential 
derivatives, analogs or peptides related to adhesive toporythmic protein 
20 sequences, can be tested for the desired binding activity e.g. , by the in vitro 
aggregation assays described in the examples herein. Adhesive derivatives or 
adhesive analogs of adhesive fragments of toporythmic proteins include but are 
not limited to those peptides which are substantially homologous to the adhesive 
fragments, or whose encoding nucleic acid is capable of hybridizing to the nucleic 
25 acid sequence encoding the adhesive fragments, and which peptides and peptide 
analogs have positive binding activity e.g., as tested in vitro by an aggregation 
assay such as described in the examples sections infra. Such derivatives and 
analogs are envisioned as Therapeutics and are within the scope of the present 
invention. 

30 The adhesive-protein related derivatives, analogs, and peptides of 

the invention can be produced by various methods known in the art. The 
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manipulations which result in their production can occur at the gene or protein 
level (see Section 5.6). 

Additionally, the adhesive-encoding nucleic acid sequence can be 
mutated in vitro or in vivo; and manipulations of the adhesive sequence may also 
be made at the protein level (see Section 5,6). 

In addition, analogs and peptides related to adhesive fragments can 
be chemically synthesized. 

5.10. ASSAYS OF NOTCH PROTEINS, 
DERIVATIVES AND ANALOGS 

The in vitro activity of Notch proteins, derivatives and analogs, 

and other topprythmic proteins which bind to Notch, can be assayed by various 

methods. 

For example, in one embodiment, where one is assaying for the 
ability to bind or compete with wild-type Notch for binding to anti-Notch 
antibody, various immunoassays known in the art can be used, including but not 
limited to competitive and non-competitive assay systems using techniques such as 
radioimmunoassays, ELISA (enzyme linked immunosorbent assay), "sandwich" 
immunoassays, immunoradiometric assays, gel diffusion precipitin reactions, 
immunodiffusion assays, in situ immunoassays (using colloidal gold, enzyme or 
radioisotope labels, for example), western blots, precipitation reactions, 
agglutination assays (e.g., gel agglutination assays, hemagglutination assays), 
complement fixation assays, immunofluorescence assays, protein A assays, and 
Immunoelectrophoresis assays, etc. In one embodiment, antibody binding is 
detected by detecting a label on the primary antibody. In another embodiment, 
the primary antibody is detected by detecting binding of a secondary antibody or 
reagent to the primary antibody. In a further embodiment, the secondary, 
antibody is labelled. Many means are known in the art for detecting binding in 
an immunoassay and are within the scope of the present invention. 

In another embodiment, where one is assaying for the ability to 
mediate binding to Notch, one can carry out an in vitro aggregation assay such as 
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described infra in Section 6 or 7 (see also Fehon et al., 1990, Cell 61:523-534; 

Rebay et al., 1991, Cell 67:687-699). 

In another embodiment, where another ligand for Notch is 

identified, ligand binding can be assayed, e.g., by binding assays well known in 
5 the art. In another embodiment, physiological correlates of ligand binding to cells 

expressing a Notch receptor (signal transduction) can be assayed. 

In another embodiment, in insect or other model systems, genetic 

studies can be done to study the phenotypic effect of a Notch mutant that is a 

derivative or analog of wild-type Notch. 
10 Other methods will be known to the skilled artisan and are within 

the scope of the invention. 

5.11. ANTIBODIES TO NOTCH PROTEINS, 

DFR TVATIVF ^ AND ANALOGS 

According to one embodiment of the invention, antibodies and 
fragments containing, the binding domain thereof, directed against Notch are 
Therapeutics. Accordingly, Notch proteins, fragments or analogs or derivatives 
thereof, in particular, human Notch proteins or fragments thereof, may be used as 
immunogens to generate anti-Notch protein antibodies. Such antibodies can be 
polyclonal, monoclonal, chimeric, single chain, Fab fragments, or from an Fab 
expression library. In a specific embodiment, antibodies specific to EGF-like 
repeats 11 and 12 of Notch may be prepared. In other embodiments, antibodies 
reactive with the extracellular domain of Notch can be generated. One example 
of such antibodies may prevent aggregation in an in vitro assay. In another 
25 embodiment, antibodies specific to human Notch are produced. 

Various procedures known in the art may be used for the 
production of polyclonal antibodies to a Notch protein or peptide. In a particular 
embodiment, rabbit polyclonal antibodies to an epitope of the human Notch 
protein encoded by a sequence depicted in Figure 10 or 11, or a subsequence 
30 thereof, can be obtained. For the production of antibody, various host animals 
can be immunized by injection with the native Notch protein, or a synthetic 
version, or fragment thereof, including but not limited to rabbits, mice, rats, etc. 
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Various adjuvants may be used to increase the immunological response, 
depending on the host species, and including but not limited to Freund's 
(complete and incomplete), mineral gels such as aluminum hydroxide, surface 
active substances such as lysolecithin, pluronic polyols, polyanions, peptides, oil 
5 emulsions, keyhold limpet hemocyanins, dinitrophenol, and potentially useful 
human adjuvants such as BCG (bacille Calmette-Guerin) and corynebacterium 
parvum. 

For preparation of monoclonal antibodies directed toward a Notch 
protein sequence, any technique which provides for the production of antibody 
10 molecules by continuous cell lines in culture may be used. For example, the 
hybridoma technique originally developed by Kohler and Milstein (1975, Nature 
256, 495-497), as well as the trioma technique, the human B-cell hybridoma 
technique (Kozbor et al., 1983, Immunology Today 4, 72), and the EBV- 
hybridoma technique to produce human monoclonal antibodies (Cole et al., 1985, 
15 in Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). 

Antibody fragments which contain the idiotype (binding domain) of 
the molecule can be generated by known techniques. For example, such fragments 
include but are not limited to: the F(ab') 2 fragment which can be produced by 
pepsin digestion of the antibody molecule; the Fab' fragments which can be 
20 generated by reducing the disulfide bridges of the F(ab') 2 fragment, and the Fab 
fragments which can be generated by treating the antibody molecule with papain 

and a reducing agent. 

In the production of antibodies, screening for the desired antibody 
can be accomplished by techniques known in the art, e.g. EL1SA (enzyme-linked 

25 immunosorbent assay). For example, to select antibodies which recognize the 
adhesive domain of a Notch protein, one may assay generated hybridomas for a 
product which binds to a protein fragment containing such domain. For selection 
of an antibody specific to human Notch, one can select on the basis of positive 
binding to human Notch and a lack of binding to Drosophila Notch. 

30 In addition to therapeutic utility, the foregoing antibodies have 

utility in diagnostic immunoassays as described in Section 5.6 supra. 
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Similar procedures to those described supra can be used to make 
Therapeutics which are antibodies to domains of other proteins (particularly 
toporythmic proteins) that bind or otherwise interact with Notch {e.g., adhesive 
fragments of Delta or Serrate). 

5 

6. DOMAINS OF NOTCH MEDIATE 
RINDING WITH DELTA 

Intermolecular association between the products of the Notch and 

Delta genes was detected by studying the effects of their expression on 

, 0 aggregation in Drosophila Schneider's 2 (S2) cells (Fehon et al. , 1990, Cell 61 , 
523-534). Direct evidence of intermolecular interactions between Notch and 
Delta is described herein, as well as an assay system that can be used in 
dissecting the components of this interaction. Normally nonadhesive Drosophila 
S2 cultured cells that express Notch bind specifically in a calcium-dependent 

15 manner to cells that express Delta. Furthermore, while cells that express Notch 
do not bind to one another, cells that express Delta do bind to one another, 
suggesting that Notch and Delta can compete for binding to Delta at the cell 
surface. Notch and Delta form detergent-soluble complexes both in cultured cells 
and embryonic cells, suggesting that Notch and Delta interact directly at the 

20 molecular level in vitro and in vivo. The analyses suggest that Notch and Delta 
proteins interact at the cell surface via their extracellular domains. 

6.1. EXPERIMENTAL PROCEDURES 

f. i 1 FYPRFSSION CONSTRUCTS 
Expression constructs are described in Fehon et al., 1990, Cell 
61:523-534. Briefly, Notch encoded by the Mglla minigene a cDNA/genomic 
chimeric construct (Ramos et al., 1989, Genetics 123, 337-348) was expressed 
following insertion into pRmHa-3 (Bunch, et al., 1988, Nucl. Acids Res. 16, 
3Q 1043-1061). In the resulting construct, designated pMtNMg, the metallothionein 
promoter in pRmHa-3 is fused to Notch sequences starting 20 nucleotides 
upstream of the translation start site. 
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The extracellular Notch construct (ECN1), was derived from a 
Notch cosmid (Ramos et al., 1989, Genetics 123, 337-348), and has an internal 
deletion of the Notch coding sequences from amino acids 1790 to 2625 inclusive 
(Wharton et al., 1985, Cell 43, 567-581), and a predicted frameshift that 
5 produces a novel 59 amino acid carboxyl terminus. 

For the Delta expression construct, the Dll cDNA (Kopczynski et 
al., 1988, Genes Dev. 2, 1723-1735; Figure 1; SEQ ID NO:l), which includes 
the complete coding capacity for Delta, was inserted into the EcoRI site of 
pRmHa-3. This construct was called pMTDll. 

10 

6.1.2. ANTIBODY PREPARATION 
Hybridoma ceil line C17.9C6 was obtained from a mouse 
immunized with a fusion protein based on a 2.1 kb Sall-Hindlll fragment that 
includes coding sequences for most of the intracellular domain of Notch (amino 

15 acids 1791-2504; Wharton et al., 1985, Cell 43, 567-581). The fragment was 
subcloned into pUR289 (Ruther and Muller-Hill, 1983, EMBO J. 2, 1791-1794), 
and then transferred into the pATH 1 expression vector (Dieckmann and 
Tzagoloff, 1985, J. Biol. Chem. 260, 1513-1520) as a Bglll-Hindlll fragment. 
Soluble fusion protein was expressed, precipitated by 25% (NH 4 ) 2 S0 4 , 

20 resuspended in 6 M urea, and purified by preparative isoelectric focusing using a 
Rotofor (Bio-Rad) (for details, see Fehon, 1989, Rotofor Review No. 7, Bulletin 
1518, Richmond, California: Bio-Rad Laboratories). 

Mouse polyclonal antisera were raised against the extracellular 
domain of Notch using four BstYl fragments of 0.8 kb (amino acids 237-501: 

25 Wharton et al., 1985, Cell 43, 567-581), 1.1 kb (amino acids 501-868), 0.99 kb 
(amino acids 868-1200), and 1.4 kb (amino acids 1465-1935) length, which 
spanned from the fifth EGF-like repeat across the transmembrane domain, singly 
inserted in-frame into the appropriate pGEX expression vector (Smith and 
Johnson, 1988, Gene 67, 31-40). Fusion proteins were purified on glutathione- 

30 agarose beads (SIGMA). Mouse and rat antisera were precipitated with 50% 
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(NH 4 ) 2 S0 4 and resuspended in PBS (150 mM NaCI. 14 mM Na 2 HP0 4 , 6 mM 

NaH 2 P0 4 ) with 0.02% NaN 3 . 

Hybridoma cell line 201 was obtained from a mouse immunized 
with a fusion protein that includes coding sequences from the extracellular domain 
5 of Delta (Kopczynski et al.. 1988, Genes Dev. 2, 1723-1735), including 

sequences extending from the fourth through the ninth EGF-like repeats in Delta 

(amino acids 350-529). 

Rat polyclonal antisera were obtained following immunization with 

antigen derived from the same fusion protein construct. In this case, fusion 

10 protein was prepared by lysis of IPTG-induced cells in SDS-Laemmli buffer 

(Carroll and Laughon, 1987, in DNA Cloning, Volume III, D.M. Glover, ed. 

(Oxford: IRL Press), pp. 89-111), separation of proteins by SDS-PAGE, excision 

of the appropriate band from the gel, and electroelution of antigen from the gel 

slice for use in immunization (Harlow and Lane, 1988, Antibodies: A Laboratory 

15 Manual (Cold Spring Harbor, New York: Cold Spring Harbor Laboratory)). 

6.1.3. m.l. (MILTURP TR ANSFECTION 
The S2 cell line (Schneider, 1972, J. Embryol. Exp. Morph. 27, 
353-365) was grown in M3 medium (prepared by Hazleton Co.) supplemented 
20 with 2.5 mg/ml Bacto-Peptone (Difco), 1 mg/ml TC Yeastolate (Difco), 11 % 
heat-inactivated fetal calf serum (FCS) (Hyclone). and 100 U/ml penicillin-100 
M g/ml streptomycin-0.25 Mg/ml fungizone (Hazleton). Cells growing in log phase 
at -2 x 10 6 cells/ml were transfected with 20 n of DNA-calcium phosphate 
coprecipitate in 1 ml per 5 ml of culture as previously described (Wigler et al., 
25 1979, Proc. Natl. Acad. Sci. USA 78, 1373-1376), with the exception that BES 
buffer (SIGMA) was used in place of HEPES buffer (Chen and Okayama, 1987, 
Mol. Cell. Biol. 7, 2745-2752). After 16-18 hr, cells were transferred to conical 
centrifuge tubes, pelleted in a clinical centrifuge at full speed for 30 seconds, 
rinsed once with 1/4 volume of fresh complete medium, resuspended in their 
30 original volume of complete medium, and returned to the original flask. 
Transfected cells were then allowed to recover for 24 hr before induction. 



35 



PCT/US93/09338 
WO 94/07474 

- 61 - 



6.1.4. ftfifiRPGATlO N ASSAYS 
Expression of the Notch and Delta metallothionein constructs was 
induced by the addition of CuS0 4 to 0.7 mM. Cells transfected with the ECN1 
construct were treated similarly. Cells were then mixed, incubated under 
5 aggregation conditions, and scored for their ability to aggregate using specific 
antisera and immunofluorescence microscopy to visualize expressing cells. 

Two types of aggregation assays were used. In the first assay, a 
total of 3 ml of cells (5-10 x 10* cells/ml) was placed in a 25 ml Erlenmeyer flask 
and rotated at 40-50 rpm on a rotary shaker for 24-48 hr at room temperature. 
10 For these experiments, cells were mixed 1-4 hr after induction began and 

induction was continued throughout the aggregation period. In the second assay, 
-0.6 ml of cells were placed in a 0.6 ml Eppendorf tube (leaving a small bubble) 
after an overnight induction (12-16 hr) at room temperature and rocked gently for 
1-2 hr at 4°C. The antibody inhibition and Ca 2+ dependence experiments were 
15 performed using the latter assay. For Ca 2+ dependence experiments, cells were 
first collected and rinsed in balanced saline solution (BSS) with 11% FCS (BSS- 
FCS; FCS was dialyzed against 0.9% NaCl, 5mM Tris [ P H 7.5]) or in Ca 2+ free 
BSS-FCS containing 10 mM EGTA (Snow et al., 1989, Cell 59, 313-323) and 
then resuspended in the same medium at the original volume. For the antibody 
20 inhibition experiments, NQtch-transfected cells were collected and rinsed in M3 
medium and then treated before aggregation in M3 medium for 1 hr at 4°C with a 
1:250 dilution of immune or preimmune sera from each of the four mice 
immunized with fusion proteins containing segments from the extracellular 
domain of Notch (see Antibody Preparation above). 

25 

6.1.5. TMMl TNOFLUO RF-SCENCE 
Cells were collected by centrifugation (3000 rpm for 20 seconds in 
an Eppendorf microcentrifuge) and fixed in 0.6 ml Eppendorf tubes with 0.5 mi 
of freshly made 2% paraformaldehyde in PBS for 10 min at room temperature. 
30 After fixing, cells were collected by centrifugation, rinsed twice in PBS, and 

stained for 1 hr in primary antibody in PBS with 0.1% saponin (SIGMA) and 1% 
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normal goat serum (Pocono Rabbit Farm, Canadensis, PA). Monoclonal antibody 
supernatants were diluted 1:10 and mouse or rat sera were diluted 1:1000 for this 
step. Cells were then rinsed once in PBS and stained for 1 hr in specific 
secondary antibodies (double-labeling grade goat anti-mouse and goat anti-rat, 

5 Jackson lmmunoresearch) in PBS-saponin-normal goat serum. After this 
incubation, cells were rinsed twice in PBS and mounted on slides in 90% 
glycerol, 10% 1 M Tris (pH 8.0), and 0.5% n-propyl gallate. Cells were viewed 
under epifluorescence on a Leitz Orthoplan 2 microscope. 

Confocal micrographs were taken using the Bio-Rad MRC 500 

10 system connected to a Zeiss Axiovert compound microscope. Images were 

collected using the BHS and GHS filter sets, aligned using the ALIGN program, 
and merged using MERGE. Fluorescent bleed-through from the green into the 
red channel was reduced using the BLEED program (ail software provided by 
Bio-Rad). Photographs were obtained directly from the computer monitor using 

IS Kodak Ektar 125 film. 

6.1.6. CELLLYSATES, IMMUNOPRECIPITATIONS, 
AND WESTERN BLOTS 

Nondenaturing detergent lysates of tissue culture and wild-type 
2 q Canton-S embryos were prepared on ice in ~ 10 cell vol of lysis buffer (300 mM 
NaCl, 50 mM Tris (pH 8.0], 0.5% NP-40 ? 0.5% deoxycholate, 1 mM CaCl 2 , 1 
mM MgCl 2 ) with 1 mM phenylmethysulfonyl fluoride (PMSF) and diisopropyl 
fluorophosphate diluted 1:2500 as protease inhibitors. Lysates were sequentially 
triturated using 18G, 21G, and 25G needles attached to 1 cc tuberculin syringes 
25 and then centrifuged at full speed in a microfuge 10 min at 4°C to remove 
insoluble material. Immunoprecipitation was performed by adding ~ 1 fig of 
antibody (1-2 of polyclonal antiserum) to 250-500 /il of cell lysate and 
incubating for 1 hr at 4°C with agitation. To this mixture, 15 /*g of goat anti- 
mouse antibodies (Jackson lmmunoresearch; these antibodies recognize both 
30 mouse and rat IgG) were added and allowed to incubate for 1 hr at 4°C with 
agitation. This was followed by the addition of 100 fi\ of fixed Staphylococcus 
aureus (Staph A) bacteria (Zy sorbin, Zymed; resuspended according to 
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manufacturer's instructions), which had been collected, washed five times in lysis 
buffer, and incubated for another hour. Staph A-antibody complexes were then 
pelleted by centrifugation and washed three times in lysis buffer followed by two 
15 min washes in lysis buffer. After being transferred to a new tube, precipitated 

5 material was suspended in 50 p\ of SDS-PAGE sample buffer, boiled immediately 
for 10 min, run on 3%-15% gradient gels, blotted to nitrocellulose, and detected 
using monoclonal antibodies and HRP-conjugated goat anti-mouse secondary 
antibodies as previously described (Johansen et al., 1989, J. Cell Biol. 109, 2427- 
2440). For total cellular protein samples used on Western blots, cells were 

10 collected by centrifugation, lysed in 10 cell vol of sample buffer that contained 1 
mM PMSF, and boiled immediately. 



6.2. RESULTS 

6.2.1. THE EXPRESSION OF NOTCH AND 
DELTA IN CULTURED CELLS 

To detect interactions between Notch and Delta, we examined the 
behavior of cells expressing these proteins on their surfaces using an aggregation 
assay. We chose the S2 cell line (Schneider, 1972, J. Embryol. Exp. Morph. 27, 
353-365) for these studies. S2 cells express an aberrant Notch message and no 
detectable Notch due to a rearrangement of the 5' end of the Notch coding 
sequence. These cells also express no detectable Delta. 

Results of Western blot and immunofluorescent analysis clearly 
showed that the Notch and Delta constructs support expression of proteins of the 
expected sizes and subcellular localization. 

6.2.2. CF1 AS THAT EXPRESS NOT CH AND DELTA AGGREGATE 

A simple aggregation assay was used to detect interactions between 
Notch and Delta expressed on the surface of S2 cells. 

S2 cells in log phase growth were separately transfected with 
either the Notch or Delta metallothionein promoter construct. After induction 
with CuS0 4 , transfected ceils were mixed in equal numbers and allowed to 
aggregate overnight at room temperature (for details, see Experimental 
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Procedures, Section 6.1). Alternatively, in some experiments intended to reduce 
metabolic activity, cells were mixed gently at 4°C for 1-2 hr. To determine 
whether aggregates had formed, cells were processed for immunofluorescence 
microscopy using antibodies specific for each gene product and differently labeled 

5 fluorescent secondary antibodies. Expressing cells usually constituted less than 
5% of the total cell population because transient rather than stable transformants 
were used. The remaining cells either did not express a given protein or 
expressed at levels too low for detection by immunofluorescence microscopy. As 
controls, we performed aggregations with only a single type of transfected cell. 

10 The results (Fehon et al., 1990, Cell 61:523-534) showed that 

while Notch-expressing (Notch*) cells alone did not form aggregates in the assay, 
Delta-expressing (Delta + ) cells did. The tendency for Delta* cells to aggregate 
was apparent even in nonaggregated control samples, where cell clusters of 4-8 
cells that probably arose from adherence between mitotic sister cells commonly 

15 occurred. However, clusters were more common after incubation under 

aggregation conditions (e.g., 19% of Delta* cells in aggregates before incubation 
vs. 37% of Delta + cells in aggregates after incubation), indicating that Delta" 
cells are able to form stable contacts with one another in this assay. 

In remarkable contrast to control experiments with Notch + cells 

20 alone, aggregation of mixtures of Notch + and Delta* cells resulted in the 

formation of clusters of up to 20 or more cells. The fraction of expressing cells 
found in clusters of four or more stained cells after 24 hr of aggregation ranged 
from 32%-54% in mixtures of Notch* and Delta* cells. This range was similar 
to that seen for Delta* cells alone (37%-40%) but very different from that for 

25 Notch* cells alone (only 0%-5%). Although a few clusters that consisted only of 
Delta* cells were found, Notch* cells were never found in clusters of greater than 
four to five cells unless Delta* cells were also present. Again, all cells within 
these clusters expressed either Notch or Delta, even though transfected cells 
composed only a small fraction of the total cell population. At 48 hr, the degree 

30 of aggregation appeared higher (63%-71 %), suggesting that aggregation had not 
yet reached a maximum after 24 hr under these conditions. Also, cells 
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cotransfected with Notch and Delta constructs (so that all transfected cells express 
both proteins) aggregated in a similar fashion under the same experimental 
conditions. 

Notch involvement in the aggregation process was directly tested 
5 by examining the effect of a mixture of polyclonal antisera directed against fusion 
proteins that spanned almost the entire extracellular domain of Notch on 
aggregation (see Experimental Procedures, Section 6.1). To minimize artifacts 
that might arise due to a metabolic response to patching of surface antigens, 
antibody treatment and the aggregation assay were performed at 4°C in these 
10 experiments. Notch + cells were incubated with either preimmune or immune 

mouse sera for 1 hr, Delta* cells were added, and aggregation was performed for 
1-2 hr. While Notch* cells pretreated with preimmune sera aggregated with 
Delta* cells (in one of three experiments, 23% of the Notch 4 cells were in 
Notch*-Delta* cell aggregates), those treated with immune sera did not (only 2% 
15 of Notch* cells were in aggregates). This result suggested that the extracellular 
domain of Notch was required for Notch*-Delta* cell aggregation. 

6.2.3. NOTCH-DELTA-MEDIATED AGGREGATION IS 
CALCIUM DFPFNDFNT 

2Q The ability of expressing cells to aggregate in the presence or 

absence of Ca 2 * ions was tested to determine whether there is a Ca 2 * ion 
requirement for Notch-Delta aggregation. To minimize possible nonspecific 
effects due to metabolic responses to the removal of Ca 2 *, these experiments were 
performed at 4°C. The results clearly demonstrated a dependence of Notch- 

25 Delta-mediated aggregation on exogenous Ca 2 *. 

6.2.4. NOTCH AND DELTA INTERACT 
WITHIN A SINGLE CELL 

The question whether Notch and Delta are associated within the 

membrane of one cell that expresses both proteins was posed by examining the 

30 distributions of Notch and Delta in cotransfected cells. To test whether the 

observed colocalization was coincidental or represented a stable interaction 
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between Notch and Delta, live cells were treated with an excess of polyclonal 
anti-Notch antiserum. This treatment resulted in "patching" of Notch on the 
surface of expressing cells into discrete patches as detected by 
immunofluorescence. There was a distinct correlation between the distributions 
5 of Notch and Delta on the surfaces of these cells after this treatment, indicating 
that these proteins are associated within the membrane. 

6 2.5. INTERACTIONS WITH DELTA DO NOT REQUIRE 
THE INTRACELLULAR DOMAIN OF NOTCH 

In addition to a large extracellular domain that contains EGF-like 

repeats, Notch has a sizeable intracellular (IC) domain of -940 amino acids. 

The IC domain includes a phosphorylation site (Kidd et al., 1989, Genes Dev. 3, 

1113-1129), a putative nucleotide binding domain, a polyglutamine stretch 

(Wharton et al., 1985, Cell 43, 567-581; Kidd, et ah, 1986, Mol. Cell. Biol. 6, 

IS 3094-3108), and sequences homologous to the yeast cdclQ gene, which is 

involved in cell cycle control in yeast (Breeden and Nasmyth, 1987, Nature 329, 
651-654). A variant Notch construct was used from which coding sequences for 
-835 amino acids of the IC domain, including ail of the structural features noted 
above, had been deletbd (leaving 25 membrane-proximal amino acids and a novel 

9n 59 amino acid carboxyl terminus; see Experimental Procedures). 



consistently formed aggregates with Delta* cells, but not with themselves, just as 
was observed for cells that expressed intact Notch. Sharp bands of ECN1 
staining were observed within regions of contact with Delta* cells, again 
indicating a localization of ECN1 within regions of contact between cells. To test 
for interactions within the membrane, surface antigen co-patching experiments 
were conducted using cells cotransfected with the ECN1 and Delta constructs. As 
observed for intact Notch, when ECN1 was patched using polyclonal antisera 
against the extracellular domain of Notch, ECN1 and Delta colocalized at the cell 
surface. These results demonstrate that the observed interactions between Notch 
and Delta within the membrane do not require the deleted portion of the IC 
domain of Notch and are therefore probably mediated by the extracellular domain. 



In aggregation assays, cells that expressed the ECN1 construct 
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6.2.6. NOTCH AND DELTA FORM DETERGENT-SOLUBLE 
INTERMOLECULAR COMPLEXES 

The preceding results indicated molecular interactions between 

Notch and Delta present within the same membrane and between these proteins 

5 expressed on different cells. A further test for such interactions is whether these 
proteins would coprecipitate from nondenaturing detergent extracts of cells that 
express Notch and Delta. If Notch and Delta form a stable intermolecular 
complex either between or within cells, then it should be possible to precipitate 
both proteins from cell extracts using specific antisera directed against one of 

JO these proteins. This analysis was performed by immunoprecipitating Delta with 
polyclonal antisera from NP-40/deoxycholate lysates (see Experimental 
Procedures) of cells cotransfected with the Notch and Delta constructs that had 
been allowed to aggregate overnight or of 0-24 hr wild-type embryos. 

Coprecipitation of Notch was detected in Delta immunoprecipitates 

15 from cotransfected cells and embryos. However, coprecipitating Notch appeared 
to be present in much smaller quantities than Delta and was therefore difficult to 
detect. The fact that immunoprecipitation of Delta results in the coprecipitation 
of Notch constitutes direct evidence that these two proteins form stable 
intermolecular completes in transfected S2 cells and in embryonic cells. 

20 

6.3. DISCUSSION 
Use of an in vitro aggregation assay that employs normally 
nonadhesive S2 cells showed that cells that express Notch and Delta adhere 
specifically to one another. 

25 

7. EGF REPEATS 11 AND 12 OF NOTCH ARE 
REQUIRED AND SUFFICIENT FOR 
NOTCH-PELT A-MEDIATFD AGGREGATION 

The same aggregation assay was used as described in Section 6, 

together with deletion mutants of Notch to identify regions within the extracellular 

30 domain of Notch necessary for interactions with Delta. The evidence shows that 

the EGF repeats of Notch are directly involved in this interaction and that only 
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two (ELR 11 and 12) of the 36 EGF repeats appear necessary. These two EGF 
repeats are sufficient for binding to Delta and that the calcium dependence of 
Notch-Delta mediated aggregation also associates with these two repeats. Finally, 
the two corresponding EGF repeats from the Xenopus homolog of Notch also 
5 mediate aggregation with Delta, implying that not only has the structure of Notch 
been evolutionary conserved, but also its function. These results suggest that 
the extracellular domain of Notch is surprisingly modular, and could potentially 
bind a variety of proteins in addition to Delta. (See Rebay et al., 1991, Cell 
67:687-699.) 

10 

7.1. EXPERIMENTAL PROCEDURES 
7.1.1. EXPRESSION CONSTRUCTS 
The constructs described are all derivatives of the full length Notch 
expression construct #1 pMtNMg (see Section 6, supra), and were made as 
15 described (Rebay et al., 1991, Ceil 67:687-699). 

7.1.2. CELL CULTURE AND TRANSFECTION 
The Drosophila S2 cell line was grown and transfected as 
described in Section 6, supra. The Delta-expressing stably transformed S2 cell 
20 line L-49-6-7 (kindly established by L. Cherbas) was grown in M3 medium 
(prepared by Hazleton Co.) supplemented with \\% heat inactivated fetal calf 
serum (FCS) (Hyclone), 100 U/ml penicillin-100 /xg/ml streptomycin-0.25 iiglm\ 
fungizone (Hazleton), 2 x 10" 7 M methotrexate, 0.1 mM hypoxanthine, and 0.016 
mM thymidine. 

25 

7.1.3. AGGREGATION ASSAYS AND IMMUNOFLUORESCENCE 
Aggregation assays and Ca ++ dependence experiments were as 
described supra, Section 6. Cells were stained with the anti-Notch monoclonal 
antibody 9C6.C17 and anti-Delta rat polyclonal antisera (details described in 
30 Section 6, supra). Surface expression of Notch constructs in unpermeabilized cells 
was assayed using rat polyclonal antisera raised against the 0.8 kb (amino acids 
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237-501; Wharton et al., 1985, Cell 43, 567-581) BstYI fragment from the 
extracellular domain of Notch. Cells were viewed under epifluorescence on a 
Leitz Orthoplan 2 microscope. 

7.2. RESULTS 

7.2.1. EGF REPEATS 11 AND 12 OF 
NOTCH ARE REQUIRED FOR 
NOTCH-DELTA MEDIATED AGGREGATION 

An extensive deletion analysis was undertaken of the extracellular 
domain of the Notch protein, which was shown (supra, Section 6; Fehon et al., 
1990, Cell 61:523-534) to be involved in Notch-Delta interactions, to identify the 
precise domain of Notch mediating these interactions. The ability of cells 
transfected with the various deletion constructs to interact with Delta was tested 
using the aggregation assay described in Section 6. Briefly, Notch deletion 
constructs were transiently transfected into Drosophila S2 cells, induced with 
CuS0 4 , and then aggregated overnight at room temperature with a small amount 
of cells from the stably transformed Delta expressing cell line L49-6-7(Cherbas), 
yielding a population typically composed of - 1% Notch expressing cells and 
-5% Delta expressing cells, with the remaining cells expressing neither protein. 

Schematic drawings of the constructs tested and results of the 
aggregation experiments are shown in Figure 2. To assay the degree of 
aggregation, cells were stained with antisera specific to each gene product and 
examined with immunofluorescent microscopy. Aggregates were defined as 
clusters of four or more cells containing both Notch and Delta expressing cells, 
and the values shown in Figure 2 represent the percentage of all Notch expressing 
cells found in such clusters. All numbers reflect the average result from at least 
two separate transfection experiments in which at least 100 Notch expressing cell 
units (either single cells or clusters) were scored. 

The initial constructs (#2 DSph and #3 ACla) deleted large 
portions of the EGF repeats. Their inability to promote Notch-Delta aggregation 
suggested that the EGF repeats of Notch were involved in the interaction with 
Delta. A series of six in-frame Clal restriction sites was used to further dissect 
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the region between EGF repeats 7 and 30. Due to sequence homology between 
repeats, five of the Clal sites occur in the same relative place within the EGF 
repeat, just after the third cysteine, while the sixth site occurs just before the first 
cysteine of EGF repeat 31 (Figure 3). Thus, by performing a partial Clal 
5 digestion and then religating, deletions were obtained that not only preserved the 
open reading frame of the Notch protein but in addition frequently maintained the 
structural integrity and conserved spacing, at least theoretically, of the three 
disulfide bonds in the chimeric EGF repeats produced by the religation (Figure 2, 
constructs #4-14). Unfortunately, the most 3' Clal site was resistant to digestion 
10 while the next most 3' Clal site broke between EGF repeats 30 and 31 . 
Therefore, when various Clal digestion fragments were reinserted into the 
framework of the complete Clal digest (construct #3 ACIa), the overall structure 
of the EGF repeats was apparently interrupted at the 3' junction. 

Several points about this series of constructs are worth noting. 
15 First, removal of the Clal restriction fragment breaking in EGF repeats 9 and 17 
(construct #8 AEGF9-17) abolished aggregation with Delta, while reinsertion of 
this piece into construct 03 ACIa, which lacks EGF repeats 7-30, restored 
aggregation to roughly wild type levels (construct #13 ACla+EGF9-17), 
suggesting that EGF repeats 9 through 17 contain sequences important for binding 
20 . Delta. Second, all constructs in this series (#4-14) were consistent with the 
t .ding site mapping to EGF repeats 9 through 17. Expression constructs 
containing these repeats (#6, 7, 9, 10, 13) promoted Notch-Delta interactions 
while constructs lacking these repeats (#4, 5, 8. 11. 12, 14) did not. To confirm 
that inability to aggregate with Delta cells was not simply due to failure of the 
25 mutagenized Notch protein to reach the cell surface, but actually reflected the 

deletion of the necessary binding site, cell surface expression of all constructs was 
tested by immunofluorescently staining live transfected cells with antibodies 
specific to the extracellular domain of Notch. All constructs failing to mediate 
Notch-Delta interactions produced a protein that appeared to be expressed 
30 normally at the cell surface. Third, although the aggregation assay is not 

quantitative, two constructs which contained EGF repeats 9-17, #9 AEGF17-26 or 
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most noticeably #10 AEGF26-30. aggregated at a seemingly lower level. Cells 
transfected with constructs #9 AEGF17-26 and 10 AEGF26-30 showed 
considerably less surface staining than normal, although fixed and permeabilized 
cells reacted with the same antibody stained normally, indicating the epitopes 
5 recognized by the antisera had not been simply deleted. By comparing the 

percentage of transfected cells in either permeabilized or live cell populations, it 
was found that roughly 50% of transfected cells for construct #9 AEGF17-26 and 
10% for construct #10 AEGF26-30 produced detectable protein at the cell 
surface. Thus these two constructs produced proteins which often failed to reach 
10 the cell surface, perhaps because of misfolding, thereby reducing, but not 

abolishing, the ability of transfected cells to aggregate with Delta-expressing cells. 

Having mapped the binding site to EGF repeats 9 through 17, 
further experiments (Rebay et al., 1991, Cell 67:687-699) revealed that EGF 
repeat 14 of Notch was not involved in the interactions with Delta modelled by 
15 the tissue culture assay. 

To further map the Delta binding domain within EGF repeats 9- 
17, specific oligonucleotide primers and the PCR technique were used to generate 
several subfragments of this region. Three overlapping constructs, #16, 17 and 
18 were produced, only one of which, #16 ACIa+EGF9-13, when transfected 
20 into S2 cells, allowed aggregation with Delta cells. Construct #19 

ACla+EGF(10-13), which lacks EGF repeat 9, further defined EGF repeats 10- 
13 as the region necessary for Notch-Delta interactions. 

Constructs #20-24 represented attempts to break this domain down 
even further using the same PCR strategy (see Figure 3). Constructs #20 
25 ACla+EGF(l 1-13), in which EGF repeat 12 is the only entire repeat added, and 
#21 ACla+EGF(10-12), in which EGF repeat 11 is the only entire repeat added, 
failed to mediate aggregation, suggesting that the presence of either EGF repeat 
11 or 12 alone was not sufficient for Notch-Delta interactions. However, since 
the 3' ligation juncture of these constructs interrupted the overall structure of the 
30 EGF repeats, it was possible that a short "buffer" zone was needed to allow the 
crucial repeat to function normally. Thus for example in construct #19 
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ACla+EGF(10-13), EGF repeat 12 might not be directly involved in binding to 
Delta but instead might contribute the minimum amount of buffer sequence 
needed to protect the structure of EGF repeat 11, thereby allowing interactions 
with Delta. Constructs #22-24 addressed this issue. Constructs #22 
5 ACla+EGF(10-ll), which did not mediate aggregation, and #23 ACIa+EGF(10- 

12), which did, again suggested that both repeats 11 and 12 are required while the 
flanking sequence from repeat 13 clearly is not. Finally, construct #24 
ACla+EGF(ll-12), although now potentially structurally disrupted at the 5' 
junction, convincingly demonstrated that the sequences from EGF repeat 10 are 
10 not crucial. Thus based on entirely consistent data from 24 constructs, EGF 

repeats 11 and 12 of Notch together define the smallest functional unit obtainable 

from this analysis that contains the necessary sites for binding to Delta in 

transfected S2 cells. 



15 7 2.2. EGF REPEATS 1 1 AND 12 OF NOTCH 

ARE SUFFICIENT FOR NOTCH-DELTA 
MEDIATED AGGREGATION 

The large Clal deletion into which PCR fragments were inserted 
(#3 ACIa) retains roughly 1/3 of the original 36 EGF repeats as well as the three 
Notch /lin-12 repeats. While these are clearly not sufficient to promote 
20 aggregation, it is possible that they form a necessary framework within which 
specific EGF repeats can interact with Delta. To test whether only a few EGF 
repeats were in fact sufficient to promote aggregation, two constructs were 
designed, #25 AEGF which deleted all 36 EGF repeats except for the first two- 
thirds of repeat 1, and #30 AECN which deleted the entire extracellular portion of 

25 Notch except for the first third of EGF repeat 1 and - 35 amino acids just before 
the transmembrane domain. Fragments which had mediated Notch-Delta 
aggregation in the background of construct #3 ACIa, when inserted into construct 
#25 AEGF, were again able to promote interactions with Delta (constructs #26- 
30). Analogous constructs (#31,32) in which the Notch/lin-12 repeats were also 

30 absent, again successfully mediated Notch-Delta aggregation. Thus EGF repeats 
11 and 12 appear to function as independent modular units which are sufficient to 
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mediate Notch-Delta interactions in S2 cells, even in the absence of most of the 
extracellular domain of Notch. 

7 2.3. EGF REPEATS 11 AND 12 OF NOTCH 

MAINTAIN THE CALCIUM DEPENDENCE OF 
5 NOTCH-DELTA MEDIATED AGGREGATION 

The ability of cells expressing certain deletion constructs to 
aggregate with Delta expressing cells was examined in the presence or absence of 
Ca ++ ions. The calcium dependence of the interaction was preserved in even the 
10 smallest construct, consistent with the notion that the minimal constructs 

containing EGF repeats 11 and 12 bind to Delta in a manner similar to that of full 
length Notch. 

7.2.4. THE DELTA BINDING FUNCTION OF EGF 
REPEATS 11 AND 12 OF NOTCH IS 
15 CONSERVED IN THE XENOPUS 

HOMOLOG OF NOTCH 

PCR primers based on the Xenopus Notch sequence (Coffman et 

al., 1990, Science 249, 1438-1441) were used to obtain an -350 bp fragment 

from a Xenopus Stage 17 cDNA library that includes EGF repeats 11 and 12 

20 flanked by half of repeats 10 and 13 on either side. This fragment was cloned 
into construct #3 ACIa, and three independent clones were tested for ability to 
interact with Delta in the cell culture aggregation assay. Two of the clones, 
#33a&bACla+XEGF(10-13), when transfected into S2 cells were able to mediate 
Notch-Delta interactions at a level roughly equivalent to the analogous Drosophila 

25 Notch construct #19ACIa+EGF(10-13), and again in a calcium dependent manner 
(Table III). However, the third clone #33cACIa+XEGF(10-13) failed to mediate 
Notch-Delta interactions although the protein was expressed normally at the cell 
surface as judged by staining live unpermeabilized cells. Sequence comparison of 
the Xenopus PCR product in constructs #33a and 33c revealed a missense . 

30 mutation resulting in a leucine to proline change (amino acid #453, Coffman, et 
al., 1990, Science 249, 1438-1441) in EGF repeat 11 of construct #33c. 
Although this residue is not conserved between Drosophila and Xenopus Notch 
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(Figure 8), the introduction of a proline residue might easily disrupt the structure 
of the EGF repeat, and thus prevent it from interacting properly with Delta. 

Comparison of the amino acid sequence of EGF repeats 11 and 12 
of Drosophila and Xenopus Notch reveals a high degree of amino acid identity, 

5 including the calcium binding consensus sequence (Figure 4, SEQ ID NO: 1 and 
NO:2). However the level of homology is not strikingly different from that 
shared between most of the other EGF repeats, which overall exhibit about 50% 
identity at the amino acid level. This one to one correspondence between the 
individual EGF repeats of Drosophila and Xenopus Notch, together with the 

10 functional conservation of ELR 1 1 and 12, suggests that the 36 EGF repeats of 
Notch comprise a tandem area of conserved functional units. 

7.3. DISCUSSION 
An extensive deletion analysis of the extracellular domain of Notch 
15 was used to show that the regions >r Notch containing EGF-homologous repeats 
11 and 12 are both necessary and sufficient for Notch-Delta-mediated 
aggregation, and that this Delta binding capability has been conserved in the same 
two EGF repeats of Xenopus Notch. The finding that the aggregation mapped to 
EGF repeats 11 and 12 of Notch demonstrates that the EGF repeats of Notch also 
20 function as specific protein binding domains. EGF repeats 1 1 and 12 alone 
(#32AECN+EGF(11-12)) were sufficient to maintain the Ca ++ dependence of 

Notch-Delta interactions. 

The various deletion constructs suggest that ELR 11 and ELR 12 
function as a modular unit, independent of the immediate context into which they 

25 are placed. Thus, neither the remaining 34 EGF repeats nor the three Metch/iin- 
12 repeats appear necessary to establish a structural framework required for EGF 
repeats 11 and 12 to function. Interestingly, almost the opposite effect was 
observed: although the aggregation assay does not measure the strength of the 
interaction, as the binding site was narrowed down to smaller and smaller 

30 fragments, an increase was observed in the ability of the transfected cells to 
aggregate with Delta expressing cells, suggesting that the normal flanking EGF 
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sequences actually impede association between the proteins. The remaining 34 
EGF repeats may also form modular binding domains for other proteins 
interacting with Notch at various times during development. 

The finding that EGF repeats 11 and 12 of Notch form a discrete 
5 Delta binding unit represents the first concrete evidence supporting the idea that 
each EGF repeat or small subset of repeats may play a unique role during 
development, possibly through direct interactions with other proteins. The 
homologies seen between the adhesive domain of Delta and Serrate (Figure 5) 
suggest that the homologous portion of Serrate is "adhesive" in that it mediates 
10 binding to other toporythmic proteins (see Section 8, infra)- m addition, the gene 
scabrous, which encodes a secreted protein with similarity to fibrinogen, may 

interact with Notch. 

In addition to the EGF repeat, multiple copies of other structural 
motifs commonly occur in a variety of proteins. One relevant example is the 
15 cdclO/ankyrin motif, six copies of which are found in the intracellular domain of 
Notch. Ankyrin contains 22 of these repeats. Perhaps repeated arrays of 
structural motifs may in general represent a linear assembly of a series of 
modular protein binding units. Given these results together with the known 
structural, genetic and developmental complexity of Notch, Notch may interact 
20 with a number of different ligands in a precisely regulated temporal and spacial 
pattern throughout development. Such context specific interactions with 
extracellular proteins could be mediated by the EGF and Mstch/iin-12 repeats, 
while interactions with cytoskeletal and cytoplasmic proteins could be mediated by 
the intracellular cdclO/ankyrin motifs. 



25 



8 SEQUENCES WHICH MEDIATE 
NlfYTfH-SERR A TF 1NTFR ACTIONS 
As described herein, the two EGF repeats of Notch which mediate 
interactions with Delta, namely EGF repeats 11 and 12, also constitute a Serrate 
30 binding domain (see Rebay et al. , 1991, Cell 67:687-699). 
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To test whether Notch and Serrate directly interact, S2 cells were 
transfected with a Serrate expression construct and mixed with Notch expressing 
cells in an aggregation assay. For the Serrate expression construct, a syntheuc 
primer containing an artificial BamHI site immediately 5' to the initiator AUG at 
5 position 442 (all sequence numbers are according to Fleming eld.. 1990, Genes 
& Dev 4-2188-2201) and homologous through position 464, was used in 
conjunction with a second primer from position 681-698 to generate a DNA 
fragment of -260 base pairs. This fragment was cut with BamHI and Kpnl 
(position 571) and ligated into Bluescript KS+ (Stratagene). This construct, 
10 BTSerS'PCR, was checked by sequencing, then cut with Kpnl. The Serrate Kpnl 
fragment (571 - 2981) was inserted and the proper orientation selected, to 
generate BTSer5 'PCR-Kpn. The 5' SacII fragment of BTSer5 'PCR-Kpn (Sacll 
sites in Bluescript polylinker and in Serrate (1199)) was isolated and used to 
replace the 5' SacII fragment of cDNA CI (Fleming et al., 1990, Genes & Dev. 
15 4:2188-2201), thus regenerating the full length Serrate cDNA minus the 5' 
untranslated regions. This insert was isolated by a Sail and partial BamHI 
digestion and shuttled into the BamHI and Sail sites of pRmHa-3 to generate the 
final expression construct, Ser-mtn. 

Serrate expressing cells adhered to Notch expressing cells in a 
20 calcium dependent manner (Figure 2 and Rebay et al. , 1991. supra). However, 
unlike Delta, under the experimental conditions tested. Serrate did not appear to 
interact homotypically. In addition, no interactions were detected between Serrate 
and Delta. 

A subset of Notch deletion constructs were tested, and showed that 
25 EGF repeats 1 1 and 12, in addition to binding to Delta, also mediate interactions 
with Serrate (Figure 2; Constructs #1. 7-10, 13, 16, 17, 19. 28, and 32). In 
addition, the Serrate-binding function of these repeats also appears to have been 
conserved in the corresponding two EGF repeats of Xenopus Notch 
(#33ACla+XEGF(10-13)). These results unambiguously show that Notch 
30 interacts with both Delta and Serrate, and that the same two EGF repeats of 
Notch mediate both interactions. The Serrate region which is essential for the 
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Notch/Serrate aggregation was also defined. Deleting nucleotides 676-1287 (i.e. 
amino acids 79-282) (See Figure 5; SEQ ID NO:3 and NO:4) eliminates the 
ability of the Serrate protein to aggregate with Notch. 

Notch and Serrate appear to aggregate less efficiently than Notch 

5 and Delta, perhaps because the Notch-Serrate interaction is weaker. One trivial 
explanation for this reduced amount of aggregation could be that the Serrate 
construct simply did not express as much protein at the cell surface as the Delta 
construct, thereby diminishing the strength of the interaction. Alternatively, the 
difference in strength of interaction may indicate a fundamental functional 

10 difference between Notch-Delta and Notch-Serrate interactions that may be 
significant in vivo. 

9 THE CLONING, SEQUENCING, AND 
REPRESSION ™= HUMAN NOTCH 

9.1. ISOLATION ANT) SEQUENCI NG OF HUMAN NOTCH 

Clones for the human Notch sequence were originally obtained 

using the polymerase chain reaction (PCR) to amplify DNA from a 17-18 week 

human fetal brain cDNA library in the Lambda Zap II vector (Stratagene). 

The 40C^bp fragment obtained in this manner was then used as a 

probe with which to screen the same library for human Notch clones. The 

original screen yielded three unique clones. hN3k, hN2K, and hN5k, all of which 

were shown by subsequent sequence analysis to fall in the 3' end of human Notch 

(Figure 6). A second screen using the 5' end of hN3k as probe was undertaken 

to search for clones encompassing the 5' end of human Notch- One unique clone, 

hN4k, was obtained from this screen, and preliminary sequencing data indicate 

that it contains most of the 5' end of the gene (Figure 6). Together, clones 

hN4k, hN3k and hN5k encompass about 10 kb of the human Notch homolog(s), 

beginning early in the EGF-repeats and extending into the 3' untranslated region 

of the gene. All three clones are cDNA inserts in the EcoRI site of pBluescript 

SK" (Stratagene). The host E. goH strain is XLl-Blue (see Maniatis, T., 1990, 

Molecular Cloning, A Laboratory Manual, 2d ed., Cold Spring Harbor 
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Laboratory, Cold Spring Harbor, New York, p. A12). An alignment of the 
human Notch sequences with Drosophila Notch is shown in Figure 7. 

The sequence of various portions of Notch contained in the cDNA 
clones was determined (by use of Sequenase®, U.S. Biochemical Corp.) and is 
5 shown for hN2k and hN4k in Figures 8 (SEQ ID NO:5-7) and 9 (SEQ ID NO:8, 
9), respectively. Further sequence analysis of hN2k revealed that it encodes a 
human Notch sequence overlapping that contained in hN5k. 

The complete nucleotide sequences of the human Notch cDNA 
contained in hN3k and hN5k was determined by the dideoxy chain termination 
10 method using the Sequenase® kit (U.S. Biochemical Corp.). Those nucleotide 
sequences encoding human Notch, in the appropriate reading frame, were readily 
identified since there are no introns and translation in only one out of the three 
possible reading frames yields a sequence which, upon comparison with the 
published Drosophila Notch deduced amino acid sequence, yields a sequence with 
15 a substantial degree of homology to the Drosophila Notch sequence. The DNA 
and deduced protein sequences of the human Notch cDNA in hN3k and hN5k are 
presented in Figures 10 (SEQ ID NO:10. 11) and 11 (SEQ ID NO:12, 13), 
respectively. Clone hN3k encodes a portion of a Notch polypeptide starting at 
approximately the third Notch /lin-12 repeat to several amino acids short of the 
20 carboxy-terminal amino acid. Clone hN5k encodes a portion of a Notch 

polypeptide starting approximately before the cdclO region through the end of the 
polypeptide, and also contains a 3' untranslated region. 

Comparing the DNA and protein sequences presented in Figure 10 
(SEQ ID NO: 10, 11) with those in Figure 11 (SEQ ID NO: 12, 13) reveals 
25 significant differences between the sequences, suggesting that hN3k and hN5k 
represent part of two distinct Notch -homologous genes. The data thus suggest 
that the human genome harbors more than one Notch-homologous gene. This is 
unlike Drosophila, where Notch appears to be a single-copy gene. 

Comparison of the DNA and amino acid sequences of the human 
30 Not£h homologs contained in hN3k and hN5k with the corresponding Drosophila 
Notch sequences (as published in Wharton et al., 1985, Cell 43:567-581) and 
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with the corresponding Xenopus Notch sequences (as published in Coffman et al., 
1990, Science 249:1438-1441 or available from Genbank® (accession number 
M33874)) also revealed differences. 



5 compared with the predicted sequence of the TAN-1 polypeptide shown in Figure 
2 of Ellisen et al., August 1991, Cell 66:649-661. Some differences were found 
between the deduced amino acid sequences; however, overall the hN3k Notch 
polypeptide sequence is 99% identical to the corresponding TAN-1 region (TAN- 
1 amino acids 1455 to 2506). Four differences were noted: in the region 

10 between the third Notch /lin-12 repeat and the first cdclO motif, there is an 

arginine (hN3k) instead of an X (TAN-1 amino acid 1763); (2) there is a proline 
(hN3k) instead of an X (TAN-1, amino acid 1787); (3) there is a conservative 
change of an aspartic acid residue (hN3k) instead of a glutamic acid residue 
(TAN-1, amino acid 2495); and (4) the carboxyl-terminal region differs 

15 substantially between TAN-1 amino acids 2507 and 2535. 



compared with the predicted sequence of the TAN-1 polypeptide shown in Figure 
2 of Ellisen et aL, August 1991, Cell 66:649-661. Differences were found 
between the deduced amino acid sequences. The deduced Notch polypeptide of 

20 hN5k is 79% identical to the TAN-1 polypeptide (64% identical to Drosophila 
Notch) in the cdclO region that encompasses both the cclO motif (TAN-1 amino 
acids 1860 to 2217) and the well-conserved flanking regions (Fig. 12). The 
cdclO region covers amino acids 1860 through 2217 of the TAN-1 sequence. In 
addition, the hN5k encoded polypeptide is 65% identical to the TAN-1 

25 polypeptide (44% identical to Drosophila Notch) at the carboxy-terminal end of 
the molecule containing a PEST (proline, glutamic acid, serine, threonine)-rich 
region (TAN-1 amino acids 2482 to 2551) (Fig. 12B). The stretch of 215 amino 
acids lying between the aforementioned regions is not well conserved among any 
of the Notch -homologous clones represented by hN3k, hN5k, and TAN-1. 

30 Neither the hN5k polypeptide nor Drosophila Notch shows significant levels of 
amino acid identity to the other proteins in this region {e.g., hN5k/TAN-l = 



The amino acid sequence shown in Figure 10 (hN3k) was 



The amino acid sequence shown in Figure 11 (hN5k) was 
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24% identity; MiSUDrosophila Notch = 11% identity; TMi-MDrosophila Notch 
= 17% identity). In contrast, Xenopus Notch (Xotch) (SEQ ID NO: 16), rat 
Notch (SEQ ID NO: 17), and TAN-1 (SEQ ID NO: 18) continue to share 
significant levels of sequence identity with one another (e.g., TAN-1 /rat Notch = 
5 75% identity, IMi-MXenopus Notch = 45% identity, rat Noich/Xenopus Notch 

= 50% identity). 

Examination of the sequence of the intracellular domains of the 
vertebrate Notch homologs shown in Figure 12B revealed an unexpected finding: 
all of these proteins, including hN5k, contain a putative CcN motif, associated 

10 with nuclear targeting function, in the conserved region following the last of the 
six cdclO repeats (Fig. 12B). Although Drosophila Notch lacks such a defined 
motif, closer inspection of its sequence revealed the presence of a possible 
bipartite nuclear localization sequence (Robbins et al., 1991, Cell 64:615-623), as 
well as of possible CK II and cdc2 phosphorylation sites, all in relative proximity 

15 to one another, thus possibly defining an alternative type of CcN motif (Fig. 
12B). 

To isolate clones covering the 5' end of hN (the human Notch 
homolog contained in part in hN5k), clone hN2k was used as a probe to screen 
260,000 plaques of human fetal brain phage library, commercially available from 

20 Stratagene, for crosshybridizing clones. Four clones were identified and isolated 
using standard procedures (Maniatis et al., 1982, Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, New 
York). Four clones were also isolated by hybridization to the Notch-homologous 
sequence of Adams et al., 1992, Nature 355:632-655. which was obtained from 

25 the ATCC. 

To isolate clones covering the 5' end of TAN-1, the human fetal 
brain library that is commercially available from Stratagene was screened for 
clones which would extend the sequence to the 5' end. 880,000 plaques were 
screened and four clones were identified which crosshybridized with the hN3k 
30 sequences. Sequencing confirmed the relative position of these sequences within 
the Notch protein encoded by TAN-1. 
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The 5' sequence of our isolated TAN-1 homolog has been 
determined through nucleotide number 972 (nucleotide number 1 being the A in 
the ATG initiation codon), and compared to the sequence as published by Ellisen 
efal (1991, Cell 66:649-661). At nucleotide 559, our TAN-1 homolog has a G, 
5 whereas Ellisen et a), disclose an A, which change results in a different encoded 
amino acid. Thus, within the first 324 amino acids, our TAN-l-encoded protein 
differs from that taught by Ellisen et al., since our protein has a Gly at position 
187, whereas Ellisen et al. disclose an Arg at that position (as presented in Figure 
13.) 

10 The full-length amino acid sequences of both the hN 

(SEQ ID NO: 19) and TAN-l-encoded (SEQ ID NO:20) proteins, as well as 
Xenopus and Drosophila Notch proteins, are shown in Figure 13. The full-length 
DNA coding sequence (except for that encoding the initiator Met) (contained in 
SEQ ID NO:21) and encoded amino acid sequence (except that the initiator Met is 

15 not shown) (contained in SEQ ID NO: 19) of hN are shown in Figure 17. 

9.2. EXPRESSION OF HUMAN NOTCH 
Expression constructs were made using the human Notch cDNA 
clones discussed in Section 9.1 above. In the cases of hN3k and hN2k, the entire 

20 clone was excised from its vector as an EcoRI restriction fragment and subcloned 
into the EcoRI restriction site of each of the three pGEX vectors (Glutathione S- 
Transferase expression vectors; Smith and Johnson, 1988, Gene 7, 31-40). This 
allows for the expression of the Notch protein product from the subclone in the 
correct reading frame. In the case of hN5k, the clone contains two internal 

25 EcoRI restriction sites, producing 2.6, 1.5 and 0.6 kb fragments. Both the 2.6 
and the 1.5 kb fragments have also been subcloned into each of the pGEX 
vectors. 

The pGEX vector system was used to obtain expression of human 
Notch fusion (chimeric) proteins from the constructs described below. The 
30 cloned Notch DNA in each case was inserted, in phase, into the appropriate 

pGEX vector. Each construct was then electroporated into bacteria (E. coh), and 
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was expressed as a fusion protein containing the Notch protein sequences fused to 
the carboxyl terminus of glutathione S-transferase protein. Expression of the 
fusion proteins was confirmed by analysis of bacterial protein extracts by 
polyacryiamide gel electrophoresis, comparing protein extracts obtained from 
5 bacteria containing the pGEX plasmids with and without the inserted Notch DNA. 
The fusion proteins were soluble in aqueous solution, and were purified from 
bacterial lysates by affinity chromatography using glutathione-coated agarose 
(since the carboxyl terminus of glutathione S-transferase binds to glutathionine). 
The expressed fusion proteins were bound by an antibody to Drosophila Notch, as 
10 assayed by Western blotting. 

The constructs used to make human Notch-glutathione S- 
transferase fusion proteins were as follows: 

hNFP#2 - PCR was used to obtain a fragment starting just before 
the cdclO repeats at nucleotide 192 of the hN5k insert to just before the 
15 PEST-rich region at nucleotide 1694. The DNA was then digested with 

BamHl and Smal and the resulting fragment was ligated into pGEX-3. 
After expression, the fusion protein was purified by binding to glutathione 
agarose. The purified polypeptide was quantitated on a 4-15% gradient 
polyacryiamide gel. The resulting fusion protein had an approximate 
20 molecular weight of 83 kD. 

hN3FPfll - The entire hN3k DNA insert (nucleotide 1 to 3235) 
was excised from the Bluescript (SK) vector by digesting with EcoRI. 
The DNA was ligated into pGEX-3. 

hN3FP#2 - A 3' segment of hN3k DNA (nucleotide 1847 to 3235) 
25 plus some of the polylinker was cut out of the Bluescript (SK) vector by 

digesting with Xmal. The fragment was ligated into pGEX-1. 

Following purification, these fusion proteins are used to make 
either polyclonal and/or monoclonal antibodies to human Notch. 

30 
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10. NOTCH EXPRESSION IN NORMAL 
AND MALIGNANT CELLS 

Various human patient tissue samples and cell lines, representing 
both normal and a wide variety of malignant cells are assayed to detect and/or 
quantitate expression of Notch. Patient tissue samples are obtained from the 
pathology department at the Yale University School of Medicine. 

The following assays are used to measure Notch expression in 
patient tissue samples: (a) Northern hybridization; (b) Western blots; (c) in situ 
hybridization; and (d) immunocytochemistry. Assays are carried out using 
standard techniques. Northern hybridization and in situ hybridization are carried 
out (i) using a DNA probe specific to the Notch sequence of clone hN3k; and (ii) 
using a DNA probe specific to the Notch sequence of clone hN5k. Western blots 
and immunocytochemistry are carried out using an antibody to Drosophila Notch 
protein (which also recognizes human Notch proteins). 

Northern hybridization and Western blots, as described above, are 
also used to analyze numerous human cell lines, representing various normal or 
cancerous tissues. The cell lines tested are listed in Table 2. 



20 



25 



30 



Tissue/Tumor 
Bone marrow 

Brain 



Table 2 
HUMAN CELL LINES 
Cell line 



Breast 



IM-9 
KG-1 

A-172 
HS 683 
U-87MG 
TE 671 

BT-20 

Hs 578Bs 

MDA-MB-330 
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Colon 



10 



15 



20 



25 



Embryo 
Kidney 

Leukemia 

Liver 

Lung 

Lymphoblasts 
Lymphoma 

Melanoma 



Myeloma 



Neuroblastoma 



30 



Ovary 



Plasma Cells 



Caco-2 
SW 48 
T84 
WiDr 

FHs 173 We 

A-498 
A-704 
Caki-2 

ARH-77 
KG-1 

Hep G2 
WRL 68 

Calu-1 
HLF-a 
SK-Lu-1 

CCRF-CEM 
HuT78 

Hs445 
MS116 
U-937 

A-375 
G-361 
Hs 294T 
SK-MEL-1 

1M-9 

RPMI 8226 

1MR-32 

SK-N-SH 

SK-N-MC 

Caov-3 
Caov-4 
PA-1 

ARH-77 



35 



WO 94/07474 




PCI7US93/09338 



10 



Sarcoma A-204 

A673 
HOS 

Skin Amdur II 

BUD-8 

Testis Tera-1 

Tera-2 

Thymus Hs67 

Uterus AN3 Ca 

HEC-l-A 



Malignancies of malignant cell tissue types which are thus shown 
to specifically express Notch can be treated as described in Section 5.1 ex seq. 

15 10.1. EXPRESSION OF HUMAN NOTCH PROTEIN 

IS INCREASED IN VARIOUS MALIGNANCIES 

As described below, we have found that human Notch protein 

expression is increased in at least three human cancers, namely cervical, breast, 

and colon cancer. Immunocytochemical staining of tissue samples from cervical, 

20 breast, and colon cancers of human patients showed clearly that the malignant 

tissue expresses high levels of Notch, at increased levels relative to non-malignant 
tissue sections. This broad spectrum of different neoplasias in which there is 
elevated Notch expression suggests that many more cancerous conditions will be 
seen to upregulate Notch. 

25 Slides of human tumor samples (for breast, colon, and cervical 

tumors) were obtained from the tissue bank of the Pathology Department, Yale 
Medical School. The stainings were done using monoclonal antibodies raised 
against the PI and P4 fusion proteins which were generated from sequences of hN 
and TAN-1, respectively. 

30 The PI and P4 fusion proteins were obtained by insertion of the 

desired human Notch sequence into the appropriate pGEX expression vector 
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(Smith and Johnson, 1988, Gene 7:31-40; AMRAD Corp., Melbourne, Australia) 
and were affinity-purified according to the instructions of the manufacturer 
(AMRAD Corp.). For production of the PI fusion protein, pGEX-2 was cut with 
BamHI and ligated to a concatamer which consists of three copies of a 518 bp 

5 BamHI-BgHI fragment of hN. Rats were immunized with the expressed protein 
and monoclonal antibodies were produced by standard procedures. For 
production of the P4 fusion protein, pGEX-2 was cut with BamHI and ligated to a 
concatamer which consists of three copies of a 473 bp BamHI-BgHI fragment of 
TAN-1. Rats were immunized with the expressed protein, and monoclonal 

10 antibodies were produced by standard procedures. 

In all tumors examined, the Notch proteins encoded by both 
human Notch homologs TAN-1 and hN were present at increased levels in the 
malignant part of the tissue compared to the normal part. Representative 
stainings are shown in the pictures provided (Figs. 14-16). 

15 The staining procedure was as follows: The tissues were fixed in 

paraformaldehyde, embedded in paraffin, cut in 5 micrometer thick sections and 
placed on glass slides. Then the following steps were carried out: 

1. Deparafinization through 4 changes of xylene, 4 minutes each. 

2. Removal of xylene through 3 changes in absolute ethanol, 4 
20 minutes each. 

3. Gradual rehydration of the tissues by immersing the slides into 
95%, 90%, 80%, 60% and 30% ethanol, 4 minutes each. At the 
end the slides were rinsed in distilled water for 5 minutes. 

4. Quenching of endogenous, peroxidase by incubating for 30 
25 minutes in 0.3% hydrogen peroxide in methanol. 

5. Washing in PBS (10 mM sodium phosphate pH 7.5, 0.9% NaCl) 
for 20 minutes. 

6. Incubation for 1 hour in blocking solution. (Blocking solution: 
PBS containing 4% normal rabbit serum and 0.1 Triton X-100.) 

30 
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11. DEPOSIT OF MICROORGANISMS 
The following recombinant bacteria, each carrying a plasmid 
encoding a portion of human Notch, were deposited on May 2, 1991 with the 
American Type Culture Collection, 1201 Parklawn Drive, Rockville, Maryland 
5 20852, under the provisions of the Budapest Treaty on the International 
Recognition of the Deposit of Microorganisms for the Purposes of Patent 
Procedures. 



Bacteria carrying 
10 E. coll XLl-Blue 
E. coH XLl-Blue 
E. coll XLl-Blue 



Plasmid ATCC Accession No. 
hN4k 68610 
hN3k 68609 
hN5k 68611 



The present invention is not to be limited in scope by the 
15 microorganisms deposited or the specific embodiments described herein. Indeed, 
various modifications of the invention in addition to those described herein will 
become apparent to those skilled in the art from the foregoing description and 
accompanying figures. Such modifications are intended to fall within the scope of 
the appended claims. 

20 Various publications are cited herein, the disclosures of which are 

incorporated by reference in their entireties. 
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7. Incubation overnight at 4°C with primary antibody diluted in 
blocking solution. Final concentration of primary antibody 20-50 
Kg/ml. 

8. Washing for 20 minutes with PBS +0. 1 % Triton X-100 (3 

5 changes). 

9. Incubation for 30 minutes with biotinylated rabbit anti-rat 
antibody: 50 ^ of biotinylated antibody (VECTOR) in 10 ml of 
blocking solution. 

10. Washing for 20 minutes with PBS +0.1 % Triton X-100 (3 

10 changes). 

1 1 . Incubation with ABC reagent (VECTOR) for 30 minutes (the 
reagent is made in PBS +0.1% Triton X-100). 

12. Washing for 20 minutes in PBS+0. 1 % Triton X-100. Followed 
by incubation for 2 minutes in PBS +0.5% Triton X-100. 

15 13. Incubation for 2-5 minutes in peroxidase substrate solution. 

Peroxidase substrate solution: Equal volumes of 0.02% hydrogen 
peroxide in distilled water and 0. 1 % diaminobenzidine 
tetrahydrochloride (DAB) in 0.1 M Tris buffer pH 7.5 are mixed 
just before the incubation with the tissues. Triton X-100 is added 

20 to the final solution at a concentration of 0.5%. 

14. Washing for 15 minutes in tap water. 

15. Counterstaining for 10 minutes with Mayer's hematoxylin. 

16. Washing for 15 minutes in tap water. 

17. Dehydration through changes in 30%, 60%, 80%, 90%, 95% and 
25 absolute ethanol (4 minutes each). 

18. Immersion into xylene (2 changes, 4 minutes each). 

19. Mounting, light microscopy. 
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(A) LENGTH : 2892 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
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GAATTCGGAG GAATTATTCA AAACATAAAC ACAATAAACA ATTTGAGTAG TTGCCGCACA 60 

CACACACACA CACAGCCCGT GGATTATTAC ACTAAAAGCG ACACTCAATC CAAAAAATCA 120 

GCAACAAAAA CATCAATAAA C ATG CAT TGG ATT AAA TGT TTA TTA ACA GCA 171 
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TTC ATT 
Phe lie 



GAG TTG 
Glu Leu 



GGT CGC 
Gly Arg 



GGC AGC 
Gly Ser 
60 

ACC ATC 
Thr He 
75 

ATT CTC 
He Leu 



AAC AAG 
Asn Lys 



TGC TTC ACA 
Cys Phe Thr 
15 

CGC CTG AAG 
Arg Leu Lys 
30 

TGC TGC AGC 
Cys Cys Ser 
45 

TGC AAG ACG 
Cys Lys Thr 



Met 
1 

GTC ATC 
Val He 



-90- 
His Trp He 



TAC TTC 
Tyr Phe 



GGG GAG 
Gly Glu 



GAC ACC ACC 
Asp Thr Thr 



GGC GAG AAC 
Gly Glu Asn 
95 

GGC TTC ACG 
Gly Phe Thr 
110 



CGG TTT 
Arg Phe 
65 

TCC CAG 
Ser Gin 
80 

TCG GTC 
Ser Val 



GTG CAG GTT 
Val Gin Val 

20 

AGC AAC GAT 
Ser Asn Asp 
35 

TCG GAC GGA 
Ser Asp Gly 
50 

CGC GTC TGC 
Arg Val Cys 



Lys Cys 
5 

CAC AGT 
His Ser 



Leu Leu Thr 



CAC GGG 
His Gly 



GCG ACG 
Ala Thr 



TGC ACC TAC 
Cys Thr Tyr 



AAT CTG ACC 
Asn Leu Thr 
100 



CTA AAG 
Leu Lys 
70 

GGG GAC 
Gly Asp 
85 

GAC GCC 
Asp Ala 



TCC GGC AGC 
Ser Gly Ser 
25 

CGG GAC AAC 
Arg Asp Asn 
40 

GGC AAG TGC 
Gly Lys Cys 

55 

CAC TAC CAG 
His Tyr Gin 



Ala 
10 

TTT 
Phe 



GAG 
Glu 



CTG 
Leu 



GCC 
Ala 



GTG ATC ACG CCC 
Val He Thr Pro 
90 

CAG CGC TTC CAG 
Gin Arg Phe Gin 
105 



AAT CCC 
Asn Pro 



CCG GGT 
Pro Gly 



AGC GGC 
Ser Gly 
140 

CAG CAG 
Gin Gin 
155 

TCG CAG 
Ser Gin 



AAC TAC 
Asn Tyr 

TCA TTT 
Ser Phe 



ACC TTC TCG 
Thr Phe Ser 
125 

AAT GCG CGA 
Asn Ala Arg 



CTG ATC 
Leu He 



GTA CTG GAG 
Val Leu Glu 



TAC ACG TCG 
Tyr Thr Ser 
175 

TAC GGA TCC 
Tyr Gly Ser 
190 



ACC AAC 
Thr Asn 
145 

GTG TCC 
Val Ser 
160 

CTG GAG 
Leu Glu 



ATC CAG TTC 
He Gin Phe 
115 

GTC GAG GCC 
Val Glu Ala 
130 

AAG CTC CTC 
Lys Leu Leu 



TCC GAA TGG 
Ser Glu Trp 



GGC TGT 
Gly Cys 



ACC GGA 
Thr Gly 
220 

TGT GAA 
Cys Glu 
235 

GGC TGG 
Gly Trp 



GGA CAC TCG 
Gly His Ser 
205 

TGG CAG GGC 
Trp Gin Gly 



ACT TGC 
Thr Cys 



TAC GAT TTC 
Tyr Asp Phe 
180 

GCC AAG TTC 
Ala Lys Phe 
195 

TCG GAG ACG 
Ser Glu Thr 
210 



CCC TTC TCG TTC TCA TGG 
Pro Phe Ser Phe Ser Trp 
120 

TGG CAT GAT ACG AAC AAT 
Trp His Asp Thr Asn Asn 
135 

ATC CAG CGA CTC TTG GTG 
lie Gin Arg Leu Leu Val 
150 

AAG ACG AAC AAG TCG GAA 
Lys Thr Asn Lys Ser Glu 
165 170 

CGT GTC ACC TGC GAT CTC 
Arg Val Thr Cys Asp Leu 
185 



CAT GGA CAT 
His Gly His 



AAG GGA GCC 
Lys Gly Ala 
255 



GAT TAC 
Asp Tyr 
225 

TGC GAC 
Cys Asp 
240 

TTG TGC 
Leu Cys 



TGT CAC ATA 
Cys His He 

AAA CCC AAT 
Lys Pro Asn 



AAC GAG TGC 
Asn Glu Cys 
260 



TGC CGG CCC CGC GAC GAT 
Cys Arg Pro Arg Asp Asp 
200 

GGC GAA ATT ATC TGT TTG 
Gly Glu He He Cys Leu 
215 

CCC AAA TGC GCC AAA GGC 
Pro Lys Cys Ala Lys Gly 
230 

CAA TGC GTT TGC CAA CTG 
Gin Cys Val Cys Gin Leu 
245 250 

GTT CTG GAA CCG AAC TGC 
Val Leu Glu Pro Asn Cys 
265 



219 



267 



315 



363 



411 



459 



507 



555 



603 



651 



699 



747 



795 



843 



891 



939 
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ATC CAT GGC ACC TGC AAC AAA CCC TGG ACT TGC ATC TGC AAC GAG GGT 987 
He His Gly Thr Cys Asn Lys Pro Trp Thr Cys He Cys Asn Glu Gly 
270 275 280 

TGG GGA GGC TTG TAC TGC AAC CAG GAT CTG AAC TAC TGC ACC AAC CAC 1035 
Trp Glv Gly Leu Tyr Cys Asn Gin Asp Leu Asn Tyr Cys Thr Asn His 
285 290 295 

AGA CCC TGC AAG AAT GGC GGA ACC TGC TTC AAC ACC GGC GAG GGA TTG 1083 
Arg Pro Cys Lys Asn Gly Gly Thr Cys Phe Asn Thr Gly Glu Gly Leu 
300 305 310 

TAC ACA TGC AAA TGC GCT CCA GGA TAC AGT GGT GAT GAT TGC GAA AAT 1131 
Tyr Thr Cys Lys Cys Ala Pro Gly Tyr Ser Gly Asp Asp Cys Glu Asn 
315 320 325 330 

GAG ATC TAC TCC TGC GAT GCC GAT GTC AAT CCC TGC CAG AAT GGT GGT 1179 
Glu He Tyr Ser Cys Asp Ala Asp Val Asn Pro Cys Gin Asn Gly Gly 
335 340 345 

ACC TGC ATC GAT GAG CCG CAC ACA AAA ACC GGC TAC AAG TGT CAT TGC 1227 
Thr Cys He Asp Glu Pro His Thr Lys Thr Gly Tyr Lys Cys His Cys 
350 355 360 

GCC AAC GGC TGG AGC GGA AAG ATG TGC GAG GAG AAA GTG CTC ACG TGT 1275 
Ala Asn Gly Trp Ser Gly Lys Met Cys Glu Glu Lys Val Leu Thr Cys 
365 370 375 

TCG GAC AAA CCC TGT CAT CAG GGA ATC TGC CGC AAC GTT CGT CCT GGC 1323 
Ser Asp Lys Pro Cys His Gin Gly He Cys Arg Asn Val Arg Pro Gly 
380 385 390 

TTG GGA AGC AAG GGT CAG GGC TAC CAG TGC GAA TGT CCC ATT GGC TAC 1371 
Leu Gly Ser Lys Gly Gin Gly Tyr Gin Cys Glu Cys Pro He Gly Tyr 
395 400 405 410 

AGC GGA CCC AAC TGC GAT CTC CAG CTG GAC AAC TGC AGT CCG AAT CCA 1419 
Ser Gly Pro Asn Cys Asp Leu Gin Leu Asp Asn Cys Ser Pro Asn Pro 
415 420 425 

TGC ATA AAC GGT GGA AGC TGT CAG CCG AGC GGA AAG TGT ATT TGC CCA 1467 
Cys He Asn Gly Gly Ser Cys Gin Pro Ser Gly Lys Cys He Cys Pro 
430 435 440 

GCG GGA TTT TCG GGA ACG AGA TGC GAG ACC AAC ATT GAC GAT TGT CTT 1515 
Ala Gly Phe Ser Gly Thr Arg Cys Glu Thr Asn He Asp Asp Cys Leu 
445 450 455 

GGC CAC CAG TGC GAG AAC GGA GGC ACC TGC ATA GAT ATG GTC AAC CAA 1563 
Gly His Gin Cys Glu Asn Gly Gly Thr Cys He Asp Met Val Asn Gin 
460 465 470 

TAT CGC TGC CAA TGC GTT CCC GGT TTC CAT GGC ACC CAC TGT AGT AGC 1611 
Tyr Arg Cys Gin Cys Val Pro Gly Phe His Gly Thr His Cys Ser Ser 
475 * 480 485 490 

AAA GTT GAC TTG TGC CTC ATC AGA CCG TGT GCC AAT GGA GGA ACC TGC 1659 
Lys Val Asp Leu Cys Leu He Arg Pro Cys Ala Asn Gly Gly Thr Cys 
495 500 505 

TTG AAT CTC AAC AAC GAT TAC CAG TGC ACC TGT CGT GCG GGA TTT ACT 1707 
Leu Asn Leu Asn Asn Asp Tyr Gin Cys Thr Cys Arg Ala Gly Phe Thr 
510 515 520 

GGC AAG GAT TGC TCT GTG GAC ATC GAT GAG TGC AGC AGT GGA CCC TGT 1755 
Gly Lys Asp Cys Ser Val Asp He Asp Glu Cys Ser Ser Gly Pro Cys 
525 530 535 
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SS K SS SS S 5S i SS S !5 2 2 2 SS 

540 545 

sssKssKKssssssxgsss^ 

555 560 

2 51? S S SS SS SS SS $ SS SS £ S SS 

575 580 

S 2 SS S £ 2 SS SS S3 SS 2 ffi S2 SS 

590 595 

S5 S 5 2 2 S3 SS S3 S SS SS 55 52 SS 

605 610 

ATG RAG CGC AAG CGT AAG CGT GCT CAG GAA AAG GAC GAC GCG 
Het Sg Ss Arg Lys Arg Ala Gin ol« Lys Asp Asp Ala 

620 625 

s E ss 2 ss ss K s ss ss s s s ss 

2 SS SS SS SS SS SS SS 2 SS 2 S SS SS 

655 660 

GGC AGC AAC AGC GGT CTC ACC TTC CAT GGC GGC AAC CCG AAT 
Gly Ser £n Ser Gly Leu Thr Phe Asp Gly oly Asn Pro Asn 

670 

AAA AAC ACC TGG GAC AAG TCG GTC AAC AAC ATT TGT GCC TCA 
JJi Sin Thr Trp Asp Lys ^er Val Asn Asn lie Cys Ala Ser 

685 °° 

ss ss ss ss ss ss ss ss ss ss ss pj SS ffi 

700 705 

CCA TAT GTG GCC TCG GTG GCG GAT AAC AAC AAT GCC AAC TCA 
Si ?S Ala Ser Val Ala Asp Asn Asn Asn Ala Asn Ser 

715 720 

2 ?5 SS 2 2 SS 2 SS SS 2 SS 2 SS SS 

* 735 740 

2 SS E i S 55 SS SS 2 SS SS SS 2 2 
Sg SS 2 SS SS? SS SS SS SS SS SS SS 5| SS 

GTT TTA GGC GAG GGT TCC TAC TGT AGC CAG CGT TGG CCC TCG 
SS S G?5 Glu Gly ser Tyr Cys Ser Gin Arg Trp Pro Ser 

780 785 

GCG GCG GGA GTG GCC GGA GCC TGT TCA TCC CAG CTA ATG GCT 
S S SJ Sal Ala Gly Ala Cys Ser Ser Gin Leu Met Ala 

795 800 



TGC GTG 
Cys Val 



TAC GAT 
Tyr Asp 
570 

GCG AGA 
Ala Arg 
585 

TTC TCC 
Phe Ser 



TTC TGC 
Phe Cys 



GAG GCC 
Glu Ala 



AAT GGC 
Asn Gly 
650 

AAA ACT 
Lys Thr 
665 

ATC ATC 
lie He 



GCA GCA 
Ala Ala 



TAC GGC 
Tyr Gly 



GAC TTT 
Asp Phe 
730 

AAC ACC 
Asn Thr 
745 

GCC AAG 
Ala Lys 



ATC TCT 
He Ser 



TTG GCG 
Leu Ala 



GCA GCT 
Ala Ala 
810 



1803 



1851 



1899 



1947 



1995 



2043 



2091 



2139 



2187 



2235 



2283 



2331 



2379 



2427 



2475 



2523 



2571 
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Phe Ser Asa Asp His Gly Arg Asp Asn Glu Gly Arg Cys Cys Ser Gly 
35 



40 45 



Glu Ser Asp Gly Ala Thr, Gly Lys Cys Leu Gly Ser Cys Lys Thr Arg 

50 55 
Phe Arg Val Cys Leu Lys His Tyr Gin Ala Thr lie Asp Thr Thr Ser 



65 



Gin Cys Thr Tyr Gly Asp Val He Thr Pro lie Leu Gly Glu Asn Ser 



85 



Val Asn Leu Thr Asp Ala Gin Arg Phe Gin Asn Lys Gly Phe Thr Asn 
100 105 
lie Gin Phe Pro Phe Ser Phe Ser Trp Pro Gly Thr Phe Ser Leu 



Pro 



He 



115 



120 



Val Glu Ala Trp His Asp Thr Asn Asn Ser Gly Asn Ala Arg Thr 



130 



135 



Asn Lys Leu Leu lie Gin Arg Leu Leu Val Gin Gin Val Leu Glu Val 
145 150 155 

Ser ser Glu Trp Lys Thr Asn Lys Ser Glu Ser Gin Tyr Thr Ser Leu 

165 170 

Glu Tyr Asp Phe Arg Val Thr Cys Asp Leu Asn Tyr Tyr Gly Ser Gly 



180 



Cys Ala Lys Phe Cys Arg Pro Arg Asp Asp Ser Phe Gly His Ser Thr 



195 



2619 



2670 



2892 



s s s s i s s s s s s: ss sr. s k ss 

GTC TGC GGC ACT CCG CAT ATG TAACTCCAAA AATCCGGAAG GGCTCCTGGT 
Val Cys Gly Thr Pro His Met 
830 

AAATCCGGAG AAATCCGCAT GGAGGAGCTG AC AG C AC AT A CACAAAGAAA AGACTGGGTT 2730 
GGGTTCAAAA TGTGAGAGAG ACGCCAAAAT GTTGTTGTTG ATTGAAGCAG TTTAGTCGTC 2790 
ACGAAAAATG AAAAATCTGT AACAGGCATA ACTCGTAAAC TCCCTAAAAA ATTTGTATAG 2850 
TAATTAGCAA AGCTGTGACC CAGCCGTTTC GATCCCGAAT TC 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 833 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Met His Trp He Lys Cys Leu Leu Thr Ala Phe lie Cys Phe Thr Val 
1 5 10 l * 

lie Val Gin Val His Ser Ser Gly Ser Phe Glu Leu Arg Leu Lys Tyr 
20 25 * 
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Cys ser Glu Thr Gly Glu lie lie Cys Leu Thr Gly Trp Gin Gly Asp 

Tyr Cys His He Pro Lys Cys Ala Lys Gly Cys Glu His Gly His Cys 

230 235 * HKJ 



225 



Asp Lys Pro Asn Gin Cys Val Cys Gin Leu Gly Trp Lys Gly Ala Leu 
245 250 

cys Asn Glu Cys Val Leu Glu Pro Asn Cys He His Gly Thr Cys Asn 
1 260 265 270 

Lys Pro Trp Thr Cys lie Cys Asn Glu Gly Trp Gly Gly Leu Tyr Cys 
275 280 285 

Asn Gin Asp Leu Asn Tyr Cys Thr Asn His Arg Pro Cys Lys Asn Gly 

290 295 300 

Gly Thr Cys Phe Asn Thr Gly Glu Gly Leu Tyr Thr Cys Lys Cys Ala 
305 310 315 320 

Pro Gly Tyr Ser Gly Asp Asp Cys Glu Asn Glu lie Tyr Ser Cys Asp 
325 330 JJS 

Ala Asp Val Asn Pro Cys Gin Asn Gly Gly Thr Cys He Asp Glu Pro 
340 345 350 

His Thr Lys Thr Gly Tyr Lys Cys His Cys Ala Asn Gly Trp Ser Gly 
355 360 365 

Lvs Met Cys Glu Glu Lys Val Leu Thr Cys Ser Asp Lys Pro Cys His 
* 370 375 380 

Gin Gly He Cys Arg Asn Val Arg Pro Gly Leu Gly Ser Lys Gly Gin 

Glv Tyr Gin Cys Glu Cys Pro He Gly Tyr Ser Gly Pro Asn Cys Asp 
y y 405 410 415 

Leu Gin Leu Asp Asn Cys Ser Pro Asn Pro Cys He Asn Gly Gly Ser 
420 425 430 

Cys Gin Pro Ser Gly Lys Cys He Cys Pro Ala Gly Phe Ser Gly Thr 
J 435 440 445 

Arq Cya Glu Thr Asn He Asp Asp Cys Leu Gly His Gin Cys Glu Asn 
450 455 460 

Gly Gly Thr Cys He Asp Met Val Asn Gin Tyr Arg Cys Gin Cys Val 
4 6 | 470 475 4BU 

Pro Gly Phe His Gly Thr His Cys Ser Ser Lys Val Asp Leu Cys Leu 
485 490 495 

He Arg Pro Cys Ala Asn Gly Gly Thr Cys Leu Asn Leu Asn Asn Asp 

500 505 51° 

Tvr Gin Cys Thr Cys Arg Ala Gly Phe Thr Gly Lys Asp Cys Ser Val 
1 sis 520 525 

Asp He Asp Glu Cys Ser Ser Gly Pro Cys His Asn Gly Gly Thr Cys 
* 530 535 540 



Met Asn Arg Val Asn Ser Phe Glu Cys Val Cys Ala Asn Gly Phe Arg 
Gly Lys Gin Cys Asp Glu Glu Ser Tyr Asp Ser Val Thr Phe Asp Ala 
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565 570 575 

His Gin Tyr Gly Ala Thr Thr Gin Ala Arg Ala Asp Gly Leu Thr Asn 
580 585 590 

Ala Gin Val Val Leu lie Ala Val Phe Ser Val Ala Met Pro Leu Val 
595 600 605 

Ala Val lie Ala Ala Cys Val Val Phe Cys Met Lys Arg Lys Arg Lys 
610 615 620 

Arg Ala Gin Glu Lys Asp Asp Ala Glu Ala Arg Lys Gin Asn Glu Gin 
625 630 635 640 

Asn Ala Val Ala Thr Met His His Asn Gly Ser Gly Val Gly Val Ala 
645 650 655 

Leu Ala Ser Ala Ser Leu Gly Gly Lys Thr Gly Ser Asn Ser Gly Leu 
660 665 670 

Thr Phe Asp Gly Gly Asn Pro Asn He lie Lys Asn Thr Trp Asp Lys 
675 680 685 

Ser Val Asn Asn He Cys Ala Ser Ala Ala Ala Ala Ala Ala Ala Ala 
690 695 700 

Ala Ala Ala Asp Glu Cys Leu Met Tyr Gly Gly Tyr Val Ala Ser Val 
705 710 715 720 

Ala Asp Asn Asn Asn Ala Asn Ser Asp Phe Cys Val Ala Pro Leu Gin 
725 730 735 

Arg Ala Lys Ser Gin Lys Gin Leu Asn Thr Asp Pro Thr Leu Met His 
740 745 750 

Ara Gly Ser Pro Ala Gly Ser Ser Ala Lys Gly Ala Ser Gly Gly Gly 
y 755 760 765 

Pro Gly Ala Ala Glu Gly Lys Arg lie Ser Val Leu Gly Glu Gly Ser 
770 775 780 

Tvr Cys Ser Gin Arg Trp Pro Ser Leu Ala Ala Ala Gly Val Ala Gly 
785 790 795 800 

Ala Cys Ser Ser Gin Leu Met Ala Ala Ala Ser Ala Ala Gly Ser Gly 
805 810 815 

Ala Gly Thr Ala Gin Gin Gin Arg Ser Val Val Cys Gly Thr Pro His 
820 825 830 



Met 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1320 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE : 

(A) NAME /KEY: CDS 

(B) LOCATION: 442.. 1320 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

CCGAGTCGAG CGCCGTGCTT CGAGCGGTGA TGAGCCCCTT TTCTGTCAAC GCTAAAGATC 60 

TACAAAACAT CAGCGCCTAT CAAGTGGAAG TGTCAAGTGT GAACAAAACA AAAACGAGAG 120 

AAGCACATAC TAAGGTCCAT ATAAATAATA AATAATAATT GTGTGTGATA ACAACATTAT 180 

CCAAACAAAA CCAAACAAAA CGAAGGCAAA GTGGAGAAAA TGATACAGCA TCCAGAGTAC 240 

GGCCGTTATT CAGCTATCCA GAGCAAGTGT AGTGTGGCAA AATAGAAACA AACAAAGGCA 300 

CCAAAATCTG CATACATGGG CTAATTAAGG CTGCCCAGCG AATTTACATT TGTGTGGTGC 360 

CAATCCAGAG TGAATCCGAA ACAAACTCCA TCTAGATCGC CAACCAGCAT CACGCTCGCA 420 

AACGCCCCCA GAATGTACAA A ATG TTT AGG AAA CAT TTT CGG CGA AAA CCA 471 

Met Phe Arg Lys His Phe Arg Arg Lys Pro 
15 10 

GCT ACG TCG TCG TCG TTG GAG TCA ACA ATA GAA TCA GCA GAC AGC CTG 519 
Ala Thr Ser Ser Ser Leu Glu Ser Thr lie Glu Ser Ala Asp Ser Leu 
15 20 25 

GGA ATG TCC AAG AAG ACG GCG ACA AAA AGG CAG CGT CCG AGG CAT CGG 567 
Gly Met ser Lys Lys Thr Ala Thr Lys Arg Gin Arg Pro Arg His Arg 
30 35 40 

GTA CCC AAA ATC GCG ACC CTG CCA TCG ACG ATC CGC GAT TGT CGA TCA 615 
Val Pro Lys He Ala Thr Leu Pro Ser Thr He Arg Asp Cys Arg Ser 
45 50 55 

TTA AAG TCT GCC TGC AAC TTA ATT GCT TTA ATT TTA ATA CTG TTA GTC 663 
Leu Lys Ser Ala Cys Asn Leu He Ala Leu He Leu He Leu Leu Val 
60 65 70 

CAT AAG ATA TCC GCA GCT GGT AAC TTC GAG CTG GAA ATA TTA GAA ATC 711 
His Lys He Ser Ala Ala Gly Asn Phe Glu Leu Glu He Leu Glu He 
75 80 85 90 

TCA AAT ACC AAC AGC CAT CTA CTC AAC GGC TAT TGC TGC GGC ATG CCA 759 
Ser Asn Thr Asn Ser His Leu Leu Asn Gly Tyr Cys Cys Gly Met Pro 
95 100 105 

GCG GAA CTT AGG GCC ACC AAG ACG ATA GGC TGC TCG CCA TGC ACG ACG 807 
Ala Glu Leu Arg Ala Thr Lys Thr He Gly Cys Ser Pro Cys Thr Thr 
110 H5 120 

GCA TTC CGG CTG TGC CTG AAG GAG TAC CAG ACC ACG GAG CAG GGT GCC 855 
Ala Phe Arg Leu Cys Leu Lys Glu Tyr Gin Thr Thr Glu Gin Gly Ala 
125 130 135 

AGC ATA TCC ACG GGC TGT TCG TTT GGC AAC GCC ACC ACC AAG ATA CTG 903 
Ser He Ser Thr Gly Cys Ser Phe Gly Asn Ala Thr Thr Lys He Leu 
140 145 150 

GGT GGC TCC AGC TTT GTG CTC AGC GAT CCG GGT GTG GGA GCC ATT GTG 951 
Gly Gly Ser Ser Phe Val Leu Ser Asp Pro Gly Val Gly Ala lie Val 
155 160 165 170 

CTG CCC TTT ACG TTT CGT TGG ACG AAG TCG TTT ACG CTG ATA CTG CAG 999 
Leu Pro Phe Thr Phe Arg Trp Thr Lys Ser Phe Thr Leu He Leu Gin 
175 180 185 

GCG TTG GAT ATG TAC AAC ACA TCC TAT CCA GAT GCG GAG AGG TTA ATT 1047 
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Ala Leu Asp Met Tyr Asn Thr Ser Tyr Pro Asp Ala Glu Arg Leu He 
190 195 200 

GAG GAA ACA TCA TAC TCG GGC GTG ATA CTG CCG TCG CCG GAG TGG AAG 1095 
Glu Glu Thr Ser Tyr Ser Gly Val He Leu Pro Ser Pro Glu Trp Lys 
205 210 - 215 

ACG CTG GAC CAC ATC GGG CGG AAC GCG CGG ATC ACC TAC CGT GTC CGG 1143 
Thr Leu Asp His He Gly Arg Asn Ala Arg He Thr Tyr Arg Val Arg 
220 225 230 

GTG CAA TGC GCC GTT ACC TAC TAC AAC ACG ACC TGC ACG ACC TTC TGC 1191 
Val Gin Cys Ala Val Thr Tyr Tyr Asn Thr Thr Cys Thr Thr Phe Cys 
235 240 245 250 

CGT CCG CGG GAC GAT CAG TTC GGT CAC TAC GCC TGC GGC TCC GAG GGT 1239 
Arg Pro Arg Asp Asp Gin Phe Gly His Tyr Ala Cys Gly Ser Glu Gly 
255 260 265 

CAG AAG CTC TGC CTG AAT GGC TGG CAG GGC GTC AAC TGC GAG GAG GCC 1287 
Gin Lys Leu Cys Leu Asn Gly Trp Gin Gly Val Asn Cys Glu Glu Ala 
270 275 280 

ATA TGC AAG GCG GGC TGC GAC CCC GTC CAC GGC 1320 
He Cys Lys Ala Gly Cys Asp Pro Val His Gly 
285 290 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 293 amino acids 

(B) TYPE; amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Phe Arg Lys His Phe Arg Arg Lys Pro Ala Thr Ser Ser Ser Leu 
15 10 15 

Glu Ser Thr He Glu Ser Ala Asp Ser Leu Gly Met Ser Lys Lys Thr 
20 25 30 

Ala Thr Lys Arg Gin Arg Pro Arg His Arg Val Pro Lys He Ala Thr 
35 40 45 

Leu Pro Ser Thr He Arg Asp Cys Arg Ser Leu Lys Ser Ala Cys Asn 
50 55 60 

Leu He Ala Leu He Leu He Leu Leu Val His Lys He Ser Ala Ala 
65 70 75 80 

Gly Asn Phe Glu Leu Glu He Leu Glu He Ser Asn Thr Asn Ser His 
85 90 95 

Leu Leu Asn Gly Tyr Cys Cys Gly Met Pro Ala Glu Leu Arg Ala Thr 
100 105 110 

Lye Thr He Gly Cys Ser Pro Cys Thr Thr Ala Phe Arg Leu Cys Leu 
115 120 125 

Lys Glu Tyr Gin Thr Thr Glu Gin Gly Ala Ser He Ser Thr Gly Cys 
130 135 140 

Ser Phe Gly Asn Ala Thr Thr Lys He Leu Gly Gly Ser Ser Phe Val 
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145 



150 155 160 



Leu Ser Asp Pro Gly Val Gly Ala lie Val Leu Pro Phe Thr Phe Arg 

Trp Thr Lys Ser Phe Thr Leu lie Leu Gin Ala Leu Asp Met Tyr Asn 

180 185 

Thr ser Tyr Pro Asp Ala Glu Arg Leu lie Glu Glu Thr Ser Tyr Ser 

^95 200 ^ u 

Gly Val lie Leu Pro Ser Pro Glu Trp Lys Thr Leu Asp His lie Gly 

210 215 
Arg Asn Ala Arg He Thr Tyr Arg Val Arg Val Gin Cys Ala Val Thr 

Tyr Tyr Asn Thr Thr Cys Thr Thr Phe Cys Arg Pro Arg Asp Asp Gin 
245 250 

Phe Gly His Tyr Ala Cys Gly Ser Glu Gly Gin Lys Leu Cys Leu Asn 

260 265 

Gly Trp Gin Gly Val Asn Cys Glu Glu Ala lie Cys Lys Ala Gly Cys 
275 280 285 

Asp Pro Val His Gly 
290 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 267 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
CGGTGGACTT CCTTCGTGTA TTGGTGGGAG CCCTCGGGAA CGGGGGGTAA CACTGAAAGG 60 
TCGAGTACCC ATTTCCGTCA TAACGGGTTG GTCGCCCCCT AGGGGTCGGA GTCAGGTGGA 120 
CGGGAGGTCG ACAACGCCCG GGGGACGGGT GGTACATGGT GTAAGGTCTT TACCGGACCG 180 
GGCAAACGGG TCACACCGAA AGGGGTGAAC GGTAACTACG GGGTCGTCCT GCCCGTCCAT 240 
CGAGTCTGGT AAGAGGGTCG CCTTAAG 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 574 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



267 
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60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
GAATTCCTTC CATTATACGT GACTTTTCTG AAACTGTAGC CACCCTAGTG TCTCTAACTC 
CCTCTGGAGT TTGTCAGCTT TGGTCTTTTC AAAGAGCAGG CTCTCTTCAA GCTCCTTAAT 120 
GCGGGCATGC TCCAGTTTGG TCTGCGTCTC AAGATCACCT TTGGTAATTG ATTCTTCTTC 180 
AACCCGGAAC TGAAGGCTGG CTCTCACCCT CTAGGCAGAG CAGGAATTCC GAGGTGGATG 240 
TGTTAGATGT GAATGTCCGT GGCCCAGATG GCTGCACCCC ATTGATGTTG GCTTCTCTCC 300 
GAGGAGGCAG CTCAGATTTG AGTGATGAAG ATGAAGATGC AGAGGACTGT TCTGCTAACA 360 
TCATCACAGA CTTGGTCTAC CAGGGTGCCA GCCTCCAGNC CAGACAGACC GGACTGGTGA 
GATGGCCCTG CACCTTGCAG CCCGCTACTC ACGGGCTGAT GCTGCCAAGC GTCTCCTGGA 
TGCAGGTGCA GATGCCAATG CCCAGGACAA CATGGGCCGC TGTCCACTCC ATGCTGCAGT 
GGCACGTGAT GCCAAGGTGT ATTCAGATCT GTTA 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 295 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



420 
480 
540 
574 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
GAATTCCATT CAGGAGGAAA GGGTGGGGAG AGAAGCAGGC ACCCACTTTC CCGTGGCTGG 
ACTCGTTCCC AGGTGGCTCC ACCGGCAGCT GTGACCGCCG CAGGTGGGGG CGGAGTGCCA 
TTCAGAAAAT TCCAGAAAAG CCCTACCCCA ACTCGGACGG CAACGTCACA CCCGTGGGTA 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
TCCAGATTCT GATTCGCAAC CGAGTAACTG ATCTAGATGC CAGGATGAAT GATGGTACTA 

CACCCCTGAT CCTGGCTGCC CGCCTGGCTG TGGAGGGAAT GGTGGCAGAA CTGATCAACT 120 

GCCAAGCGGA TGTGAATGCA GTGGATGACC ATGGAAAATC TGCTCTTCAC TGGGCAGCTG 180 

CTGTCAATAA TGTGGAGGCA ACTCTTTTGT TGTTGAAAAA TGGGGCCAAC CGAGACATGC 240 

AGGACAACAA GGAAGAGACA CCTCTGTTTC TTGCTGCCCG GGAGGAGCTA TAAGC 295 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 248 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



60 
120 
180 
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GCAACTGGCA CACAAACAGC CAGCGTGTCT GGGGCACGGG GGGATGGCAC CCCCTGCAGG 
CAGAGCTG 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 323 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 

TACGTATCTC GAGCACAGAC AGCTGACGTA CACTTTTNNA GTGCGAGGGA CATTCGTCCG 60 

ACCAGTACGA ACATTTAGGC TCAGTACGGT AGGTCCATGG CCAAGACTAG GAGACGTAGG 120 

GAGCTACAGG TCCCGCTCGC TAAACTCGGA CCACTGAAAC CTCCGGTCGA CAGTCGGTAA 180 

GCGAACAAGA GGGCCAGATC TTAGAGAAGG TGTCGCGGCG AGACTCGGGC TCGGGTCAGG 240 

CGGCCTTAAG GACGTCGGGC CCNNNAGGTG ATCAAGATCT CGNCNCGGCG GGCGCCACCT 300 

CGAGGNCGAA AACAAGGGAA ATC 323 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3234 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..3234 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

TGC CAG GAG GAC GCG GGC AAC AAG GTC TGC AGC CTG CAG TGC AAC AAC 48 
Cvs Gin Glu Asp Ala Gly Asn Lys Val Cys Ser Leu Gin Cys Asn Asn 
1 5 10 15 

CAC GCG TGC GGC TGG GAC GGC GGT GAC TGC TCC CTC AAC TTC AAT GAC 96 
His Ala Cys Gly Trp Asp Gly Gly Asp Cys Ser Leu Asn Phe Asn Asp 
20 25 30 

CCC TGG AAG AAC TGC ACG CAG TCT CTG CAG TGC TGG AAG TAC TTC AGT 144 
Pro Trp Lys Asn Cys Thr Gin Ser. Leu Gin Cys Trp Lys Tyr Phe Ser 
35 40 45 

GAC GGC CAC TGT GAC AGC CAG TGC AAC TCA GCC GGC TGC CTC TTC GAC 192 
Asp Gly His Cys Asp Ser Gin Cys Asn Ser Ala Gly Cys Leu Phe Asp 
50 55 60 

GGC TTT GAC TGC CAG CGT GCG GAA GGC CAG TGC AAC CCC CTG TAC GAC 240 
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Gly Phe Asp Cys Gin Arg Ala Glu Gly Gin Cys Asn Pro Leu Tyr Asp 



65 



480 



CAG TAC TGC AAG GAC CAC TTC AGC GAC GGG CAC TGC GAC CAG GGC TGC 288 
Gin Tyr Cys Lys Asp His Phe Ser Asp Gly His Cys Asp Gin Gly Cys 
85 90 « 

AAC AGC GCG GAG TGC GAG TGG GAC GGG CTG GAC TGT GCG GAG CAT OTA 336 
£S S« Ala Glu Cys Glu Trp Asp Gly Leu Asp Cys Ala Glu His Val 
100 105 

CCC GAG AGG CTG GCG GCC GGC ACG CTG GTG GTG GTG GTG CTG ATG CCG 384 
Pro Su A?g Ala Ala Gly Thr Leu Val Val Val Val Leu Met Pro 
115 120 125 

CCG GAG CAG CTG CGC AAC AGC TCC TTC CAC TTC CTG CGG GAG CTC AGC 432 
Pro Su Sn Leu Arg Asn Ser Ser Phe His Phe Leu Arg Glu Leu Ser 
130 135 14 

CGC GTG CTG CAC ACC AAC GTG GTC TTC AAG CGT GAC GCA CAC GGC CAG 
Arg Val Leu His Thr Asn Val Val Phe Lys Arg Asp Ala His Gly Gin 
14 5 150 155 160 

CAG ATG ATC TTC CCC TAC TAC GGC CGC GAG GAG GAG CTG CGC AAG CAC 528 
Gin Met lie Phe Pro Tyr Tyr Gly Arg Glu Glu Glu Leu Arg Lys His 
165 170 1'5 

CCC ATC AAG CGT GCC GCC GAG GGC TGG GCC GCA CCT GAC GCC CTG CTG 576 
Pro lie Lys Arg Ala Ala Glu Gly Trp Ala Ala Pro Asp Ala Leu Leu 
180 185 190 

GGC CAG GTG AAG GCC TCG CTG CTC CCT GOT GGC AGC GAG GGT GGG CGG 624 
Gl5 SS S3 lVs Ala Ser Leu Leu Pro Gly Gly Ser Glu Gly Gly Arg 
195 200 205 

CGG CGG AGG GAG CTG GAC CCC ATG GAC GTC CGC GGC TCC ATC GTC TAC 672 
Arg Arg Arg Glu Leu Asp Pro Met Asp Val Arg Gly Ser lie Val Tyr 
210 215 220 

CTG GAG ATT GAC AAC CGG CAG TGT GTG CAG GCC TCC TCG CAG TGC TTC 720 
Leu Glu lie Asp Asn Arg Gin Cys Val Gin Ala Ser Ser Gin Cys Phe 
225 230 235 

CAG AGT GCC ACC GAC GTG GCC GCA TTC CTG GGA GCG CTC GCC TCG CTG 768 
Gin ser Ala Thr Asp Val Ala Ala Phe Leu Gly Ala Leu Ala Ser Leu 
245 250 " 5 

rrr RGC CTC AAC ATC CCC TAC AAG ATC GAG GCC GTG CAG AGT GAG ACC 816 
S? sir LeS ten lie Pro Tyr Lys lie Glu Ala Val Gin Ser Glu Thr 
260 265 270 

GTG GAG CCG CCC CCG CCG GCG CAG CTG CAC TTC ATG TAC GTG GCG GCG 864 
Val Glu Pro Pro Pro Pro Ala Gin Leu His Phe Met Tyr Val Ala Ala 
275 280 285 

CCC GCC TTT GTG CTT CTG TTC TTC GTG GGC TGC GGG GTG CTG CTG TCC 912 
Ala Ala Phe Val Leu Leu Phe Phe Val Gly Cys Gly Val Leu Leu Ser 
290 295 300 

CGC AAG CGC CGG CGG CAG CAT GGC CAG CTC TGG TTC CCT GAG GGC TTC 960 
Arg Lys Arg Arg Arg Gin His Gly Gin Leu Trp Phe Pro Glu Gly Phe 
305 ' 310 315 J^O 

AAA GTG TCT GAG GCC AGC AAG AAG AAG CGG CGG GAG CCC CTC GGC GAG 1008 
iy^ Val Ser Glu Ala Ser Lys Lys Lys Arg Arg Glu Pro Leu Gly Glu 

325 330 J " 
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GAC TCC GTG GGC CTC AAG CCC CTG AAG AAC GCT TCA GAC GGT GCC CTC 1056 
Asp Ser Val Gly Leu Lys Pro Leu Lys Asn Ala Ser Asp Gly Ala Leu 
340 345 350 

ATG GAC GAC AAC CAG AAT GAG TGG GGG GAC GAG GAC CTG GAG ACC AAG 1104 
Met Asp Asp Asn Gin Asn Glu Trp Gly Asp Glu Asp Leu Glu Thr Lys 
355 360 365 

AAG TTC CGG TTC GAG GAG CCC GTG GTT CTG CCT GAC CTG GAC GAC CAG 1152 
Lys Phe Arg Phe Glu Glu Pro Val Val Leu Pro Asp Leu Asp Asp Gin 
370 375 380 

ACA GAC CAC CGG CAG TGG ACT CAG CAG CAC CTG GAT GCC GCT GAC CTG 1200 
Thr Asp His Arg Gin Trp Thr Gin Gin His Leu Asp Ala Ala Asp Leu 
385 390 395 400 

CGC ATG TCT GCC ATG GCC CCC ACA CCG CCC CAG GGT GAG GTT GAC GCC 1248 
Arq Met Ser Ala Met Ala Pro Thr Pro Pro Gin Gly Glu Val Asp Ala 
405 410 415 

GAC TGC ATG GAC GTC AAT GTC CGC GGG CCT GAT GGC TTC ACC CCG CTC 1296 
Asp Cys Met Asp Val Asn Val Arg Gly Pro Asp Gly Phe Thr Pro Leu 
420 425 430 

ATG ATC GCC TCC TGC AGC GGG GGC GGC CTG GAG ACG GGC AAC AGC GAG 1344 
Met He Ala Ser Cys Ser Gly Gly Gly Leu Glu Thr Gly Asn Ser Glu 
435 440 445 

GAA GAG GAG GAC GCG CCG GCC GTC ATC TCC GAC TTC ATC TAC CAG GGC 1392 
Glu Glu Glu Asp Ala Pro Ala Val He Ser Asp Phe He Tyr Gin Gly 
450 455 460 

GCC AGC CTG CAC AAC CAG ACA GAC CGC ACG GGC GAG ACC GCC TTG CAC 1440 
Ala Ser Leu His Asn Gin Thr Asp Arg Thr Gly Glu Thr Ala Leu His 
465 470 475 480 

CTG GCC GCC CGC TAC TCA CGC TCT GAT GCC GCC AAG CGC CTG CTG GAG 1488 
Leu Ala Ala Arg Tyr Ser Arg Ser Asp Ala Ala Lys Arg Leu Leu Glu 
485 490 495 

GCC AGC GCA GAT GCC AAC ATC CAG GAC AAC ATG GGC CGC ACC CCG CTG 1536 
Ala Ser Ala Asp Ala Asn He Gin Asp Asn Met Gly Arg Thr Pro Leu 
500 505 510 

CAT GCG GCT GTG TCT GCC GAC GCA CAA GGT GTC TTC CAG ATC CTG ATC 1584 
His Ala Ala Val Ser Ala Asp Ala Gin Gly Val Phe Gin He Leu He 
515 520 525 

CGG AAC CGA GCC ACA GAC CTG GAT GCC CGC ATG CAT GAT GGC ACG ACG 1632 
Arg Asn Arg Ala Thr Asp Leu Asp Ala Arg Met His Asp Gly Thr Thr 
530 535 540 

CCA CTG ATC CTG GCT GCC CGC CTG GCC GTG GAG GGC ATG CTG GAG GAC 1680 
Pro Leu He Leu Ala Ala Arg Leu Ala Val Glu Gly Met Leu Glu Asp 
545 550 555 560- 

CTC ATC AAC TCA CAC GCC GAC GTC AAC GCC GTA GAT GAC CTG GGC AAG 1728 
Leu He Asn Ser His Ala Asp Val Asn Ala Val Asp Asp Leu Gly Lys 
565 570 575 

TCC GCC CTG CAC TGG GCC GCC GCC GTG AAC AAT GTG GAT GCC GCA GTT 1776 
Ser Ala Leu His Trp Ala Ala Ala Val Asn Asn Val Asp Ala Ala Val 
580 585 590 

GTG CTC CTG AAG AAC GGG GCT AAC AAA GAT ATG CAG AAC AAC AGG GAG 1824 
Val Leu Leu Lys Asn Gly Ala Asn Lys Asp Met Gin Asn Asn Arg Glu 
595 600 605 
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GAG ACA CCC CTG TTT CTG GCC GCC CGG GAG GGC AGC TAC GAG ACC GCC 1872 
Glu Thr Pro Leu Phe Leu Ala Ala Arg Glu Gly Ser Tyr Glu Thr Ala 
610 615 620 

AAG GTG CTG CTG GAC CAC TTT GCC AAC CGG GAC ATC ACG GAT CAT ATG 1920 
Lvs Val Leu Leu Asp His Phe Ala Asn Arg Asp lie Thr Asp His Met 
625 630 635 640 

GAC CGC CTG CCG CGC GAC ATC GCA CAG GAG CGC ATG CAT CAC GAC ATC 1968 
Abp Arg Leu Pro Arg Asp He Ala Gin Glu Arg Met His His Asp He 
645 650 655 

GTG AGG CTG CTG GAC GAG TAC AAC CTG GTG CGC AGC CCG CAG CTG CAC 2016 
Val Ara Leu Leu Asp Glu Tyr Asn Leu Val Arg Ser Pro Gin Leu His 
660 665 670 

GGA GCC CCG CTG GGG GGC ACG CCC ACC CTG TCG CCC CCG CTC TGC TCG 2064 
Gly Ala Pro Leu Gly Gly Thr Pro Thr Leu Ser Pro Pro Leu Cys Ser 
675 680 685 

CCC AAC GGC TAC CTG GGC AGC CTC AAG CCC GGC GTG CAG GGC AAG AAG 2112 
Pro Asn Gly Tyr Leu Gly Ser Leu Lys Pro Gly Val Gin Gly Lys Lys 
690 695 700 

GTC CGC AAG CCC AGC AGC AAA GGC CTG GCC TGT GGA AGC AAG GAG GCC 2160 
Val Arg Lys Pro Ser Ser Lys Gly Leu Ala Cys Gly Ser Lys Glu Ala 
705 710 715 720 

AAG GAC CTC AAG GCA CGG AGG AAG AAG TCC CAG GAT GGC AAG GGC TGC 2208 
Lvs Asp Leu Lys Ala Arg Arg Lys Lys Ser Gin Asp Gly Lys Gly Cys 
* 725 730 735 

CTG CTG GAC AGC TCC GGC ATG CTC TCG CCC GTG GAC TCC CTG GAG TCA 2256 
Leu Leu Asp Ser Ser Gly Met Leu Ser Pro Val Asp Ser Leu Glu Ser 
740 745 750 

CCC CAT GGC TAC CTG TCA GAC GTG GCC TCG CCG CCA CTG CTG CCC TCC 2304 
Pro His Gly Tyr Leu ServAsp Val Ala Ser Pro Pro Leu Leu Pro Ser 
755 760 765 

CCG TTC CAG CAG TCT CCG TCC GTG CCC CTC AAC CAC CTG CCT GGG ATG 2352 
Pro Phe Gin Gin Ser Pro Ser Val Pro Leu Asn His Leu Pro Gly Met 
770 775 780 

CCC GAC ACC CAC CTG GGC ATC GGG CAC CTG AAC GTG GCG GCC AAG CCC 2400 
Pro Asp Thr His Leu Gly He Gly His Leu Asn Val Ala Ala Lys Pro 
785 . 790 795 800 

GAG ATG GCG GCG CTG GGT GGG GGC GGC CGG CTG GCC TTT GAG ACT GGC 2448 
Glu Met Ala Ala Leu Gly Gly Gly Gly Arg Leu Ala Phe Glu Thr Gly 
805 810 815 

CCA CCT CGT CTC TCC CAC CTG CCT GTG GCC TCT GGC ACC AGC ACC GTC 2496 
Pro Pro Arg Leu Ser His Leu Pro Val Ala Ser Gly Thr Ser Thr Val 
820 825 830 

CTG GGC TCC AGC AGC GGA GGG GCC CTG AAT TTC ACT GTG GGC GGG TCC 2544 
Leu Gly Ser Ser Ser Gly Gly Ala Leu Asn Phe Thr Val Gly Gly Ser 
835 840 845 

ACC AGT TTG AAT GGT CAA TGC GAG TGG CTG TCC CGG CTG CAG AGC GGC 2592 
Thr Ser Leu Asn Gly Gin Cys Glu Trp Leu Ser Arg Leu Gin Ser Gly 
850 855 860 

ATG GTG CCG AAC CAA TAC AAC CCT CTG CGG GGG AGT GTG GCA CCA GGC 2640 
Met Val Pro Asn Gin Tyr Asn Pro Leu Arg Gly Ser Val Ala Pro Gly 
865 870 875 .880 
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CCC CTG AGC ACA CAG GCC CCC TCC CTG CAG CAT GGC ATG GTA GGC CCG 
Pro Leu Ser Thr Gin Ala Pro Ser Leu Gin His Gly Met Val Gly Pro 
885 890 895 

CTG CAC AGT AGC CTT GCT GCC AGC GCC CTG TCC CAG ATG ATG AGC TAC 
Leu His Ser Ser Leu Ala Ala Ser Ala Leu Ser Gin Met Met Ser Tyr 
900 905 910 

CAG GGC CTG CCC AGC ACC CGG CTG GCC ACC CAG CCT CAC CTG GTG CAG 
Gin Glv Leu Pro Ser Thr Arg Leu Ala Thr Gin Pro His Leu Val Gin 
915 920 925 

ACC CAG CAG GTG CAG CCA CAA AAC TTA CAG ATG CAG CAG CAG AAC CTG 
Thr Gin Gin Val Gin Pro Gin Asn Leu Gin Met Gin Gin Gin Asn Leu 
930 935 940 

CAG CCA GCA AAC ATC CAG CAG CAG CAA AGC CTG CAG CCG CCA CCA CCA 
Gin Pro Ala Asn He Gin Gin Gin Gin Ser Leu Gin Pro Pro Pro Pro 
945 950 955 960 

CCA CCA CAG CCG CAC CTT GGC GTG AGC TCA GCA GCC AGC GGC CAC CTG 
Pro Pro Gin Pro His Leu Gly Val Ser Ser Ala Ala Ser Gly His Leu 
965 970 975 

GGC CGG AGC TTC CTG AGT GGA GAG CCG AGC CAG GCA GAC GTG CAG CCA 
Glv Arg Ser Phe Leu Ser Gly Glu Pro Ser Gin Ala Asp Val Gin Pro 
980 985 990 

CTG GGC CCC AGC AGC CTG GCG GTG CAC ACT ATT CTG CCC CAG GAG AGC 
Leu Gly Pro Ser Ser Leu Ala Val His Thr He Leu Pro Gin Glu Ser 
995 1000 1005 

CCC GCC CTG CCC ACG TCG CTG CCA TCC TCG CTG GTC CCA CCC GTG ACC 
Pro Ala Leu Pro Thr Ser Leu Pro Ser Ser Leu Val Pro Pro Val Thr 
1010 1015 1020 

GCA GCC CAG TTC CTG ACG CCC CCC TCG CAG CAC AGC TAC TCC TCG CCT 
Ala Ala Gin Phe Leu Thr Pro Pro Ser Gin His Ser Tyr Ser Ser Pro 
1025 1030 1035 1040 

GTG GAC AAC ACC CCC AGC CAC CAG CTA CAG GTG CCT GTT CCT GTA ATG 
Val Asp Asn Thr Pro Ser His Gin Leu Gin Val Pro Val Pro Val Met 
1045 1050 1055 

GTA ATG ATC CGA TCT TCG GAT CCT TCT AAA GGC TCA TCA ATT TTG ATC 
Val Met He Arg Ser Ser Asp Pro Ser Lys Gly Ser Ser He Leu He 
1060 1065 1070 

GAA GCT CCC GAC TCA TGG 
Glu Ala Pro Asp Ser Trp 
1075 



2688 



2736 



2784 



2832 



2880 



2928 



2976 



3024 



3072 



3120 



3168 



3216 



3234 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1078 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Cys Gin Glu Asp Ala Gly Asn Lys Val Cys Ser Leu Gin Cys Asn Asn 
J l 5 10 15 



PCI7US93/09338 

WO 94/07474 "~ 

-105- 

Hia Ala Cys Gly Trp Asp Gly Gly Asp Cys Ser Leu Asn Phe Asn Asp 
20 25 

Pro Trp Lys Asn Cys Thr Gin Ser Leu Gin Cys Trp Lys Tyr Phe Ser 
35 40 

As p Gly His Cys Asp Ser Gin Cys Asn Ser Ala Gly Cys Leu Phe Asp 
* 50 55 60 

Gly Phe Asp Cys Gin Arg Ala Glu Gly Gin Cys Asn Pro Leu Tyr Asp 

65 70 75 

Gin Tyr Cys Lys Asp His Phe Ser Asp Gly His Cys Asp Gin Gly Cys 

Asn Ser Ala Glu Cys Glu Trp Asp Gly Leu Asp Cys Ala Glu His Val 
100 105 

Pro Glu Arg Leu Ala Ala Gly Thr Leu Val Val Val Val Leu Met Pro 
115 120 "a 

Pro Glu Gin Leu Arg Asn Ser Ser Phe His Phe Leu Arg Glu Leu Ser 
130 I 35 14 

Arg Val Leu His Thr Asn Val Val Phe Lys Arg Asp Ala His Gly Gin 

145 150 I 55 

Gin Met He Phe Pro Tyr Tyr Gly Arg Glu Glu Glu Leu Arg Lys His 
165 170 110 

Pro He Lys Arg Ala Ala Glu Gly Trp Ala Ala Pro Asp Ala Leu Leu 

Gly Gin Val Lys Ala Ser Leu Leu Pro Gly Gly Ser Glu Gly Gly Arg 
195 200 20:> 

Arg Arg Arg Glu Leu Asp Pro Met Asp Val Arg Gly Ser lie Val Tyr 

210 215 
Leu Glu lie Asp Asn Arg Gin Cys Val Gin Ala Ser Ser Gin Cys Phe 
225 230 

Gin Ser Ala Thr Asp Val Ala Ala Phe Leu Gly Ala Leu Ala Ser Leu 

245 250 
Gly Ser Leu Asn lie Pro Tyr Lys lie Glu Ala Val Gin Ser Glu Thr 

Val Glu Pro Pro Pro Pro Ala Gin Leu His Phe Met Tyr Val Ala Ala 

275 280 2 

Ala Ala Phe Val Leu Leu Phe Phe Val Gly Cys Gly Val Leu Leu Ser 

290 295 300 

Arg Lys Arg Arg Arg Gin His Gly Gin Leu Trp Phe Pro Glu Gly Phe 
305 310 

Lys Val Ser Glu Ala Ser Lys Lys Lys Arg Arg Glu Pro Leu Gly Glu 



325 



Asp Ser Val Gly Leu Lys Pro Leu Lys Asn Ala Ser Asp Gly Ala Leu 

Met Asp Asp Asn Gin Asn Glu Trp Gly Asp Glu Asp Leu Glu Thr Lys 

355 360 
Lys Phe Arg Phe Glu Glu Pro Val Val Leu Pro Asp Leu Asp Asp Gin 
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-106- 

370 375 380 



Thr Asp His Arg Gin Trp Thr Gin Gin His Leu Asp Ala Ala Asp Leu 
385 390 395 400 

Ara Met Ser Ala Met Ala Pro Thr Pro Pro Gin Gly Glu Val Asp Ala 
A y 405 410 415 

Asp Cys Met Asp Val Asn Val Arg Gly Pro Asp Gly Phe Thr Pro Leu 
420 425 4 

Met He Ala Ser Cys Ser Gly Gly Gly Leu Glu Thr Gly Asn Ser Glu 
435 440 445 

Glu Glu Glu Asp Ala Pro Ala Val He Ser Asp Phe lie Tyr Gin Gly 
450 455 460 

Ala Ser Leu His Asn Gin Thr Asp Arg Thr Gly Glu Thr Ala Leu His 
465 470 475 480 

Leu Ala Ala Arg Tyr Ser Arg Ser Asp Ala Ala Lys Arg Leu Leu Glu 
485 490 495 

Ala Ser Ala Asp Ala Asn He Gin Asp Asn Met Gly Arg Thr Pro Leu 
500 505 510 

His Ala Ala Val Ser Ala Asp Ala Gin Gly Val Phe Gin He Leu He 
515 520 525 

Arq Asn Arg Ala Thr Asp Leu Asp Ala Arg Met His Asp Gly Thr Thr 
530 535 540 

Pro Leu He Leu Ala Ala Arg Leu Ala Val Glu Gly Met Leu Glu Asp 
545 550 555 560 

Leu lie Asn Ser His Ala Asp Val Asn Ala Val Asp Asp Leu Gly Lys 
565 570 575 

Ser Ala Leu His Trp Ala Ala Ala Val Asn Asn Val Asp Ala Ala Val 
580 585 590 

Val Leu Leu Lys Asn Gly Ala Asn Lys Asp Met Gin Asn Asn Arg Glu 
595 600 605 

Glu Thr Pro Leu Phe Leu Ala Ala Arg Glu Gly Ser Tyr Glu Thr Ala 
610 615 620 

Lys Val Leu Leu Asp His Phe Ala Asn Arg Asp lie Thr Asp His Met 
625 630 635 640 

Asp Arg Leu Pro Arg Asp He Ala Gin Glu Arg Met His His Asp He 
645 650 655 

Val Arg Leu Leu Asp Glu Tyr Asn Leu Val Arg Ser Pro Gin Leu His 
660 665 670 

Gly Ala Pro Leu Gly Gly Thr Pro Thr Leu Ser Pro Pro Leu Cys Ser 
675 680 685 

Pro Asn Gly Tyr Leu Gly Ser Leu Lys Pro Gly Val Gin Gly Lys Lys 
690 695 700 

Val Arg Lys Pro Ser Ser Lys Gly Leu Ala Cys Gly Ser Lys Glu Ala 
705 710 715 720 

Lys Asp Leu Lys Ala Arg Arg Lys Lys Ser Gin Asp Gly Lys Gly Cys 
1 * 725 730 735 
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Leu Leu Asp Ser Ser Gly Met Leu Ser Pro Val Asp Ser Leu Glu Ser 
740 745 

Pro His Gly Tyr Leu Ser Asp Val Ala Ser Pro Pro Leu Leu Pro Ser 
755 760 

Pro Phe Gin Gin Ser Pro Ser Val Pro Leu Asn His Leu Pro Gly Met 

770 775 780 

Pro Asp Thr His Leu Gly He Gly His Leu Asn Val Ala Ala Lys Pro 
785 790 795 ouu 

Glu Met Ala Ala Leu Gly Gly Gly Gly Arg Leu Ala Phe Glu Thr Gly 

805 810 

Pro Pro Arg Leu Ser His Leu Pro Val Ala Ser Gly Thr Ser Thr Val 
825 OJU 



820 



Leu Gly Ser Ser Ser Gly Gly Ala Leu Asn Phe Thr Val Gly Gly Ser 



835 



Thr Ser Leu Asn Gly Gin Cys Glu Trp Leu Ser Arg Leu Gin Ser Gly 
850 855 860 

Met Val Pro Asn Gin Tyr Asn Pro Leu Arg Gly Ser Val Ala Pro Gly 
865 870 875 

Pro Leu Ser Thr Gin Ala Pro Ser Leu Gin His Gly Met Val Gly Pro 
885 890 895 

Leu His Ser Ser Leu Ala Ala Ser Ala Leu Ser Gin Met Met Ser Tyr 
900 905 910 

Gin Gly Leu Pro Ser Thr Arg Leu Ala Thr Gin Pro His Leu Val Gin 
915 920 925 

Thr Gin Gin Val Gin Pro Gin Asn Leu Gin Met Gin Gin Gin Asn Leu 
930 935 940 

Gin Pro Ala Asn lie Gin Gin Gin Gin Ser Leu Gin Pro Pro Pro Pro 
945 950 955 

Pro Pro Gin Pro His Leu Gly Val Ser Ser Ala Ala Ser Gly His Leu 
965 970 

Gly Arg Ser Phe Leu Ser Gly Glu Pro Ser Gin Ala Asp Val Gin Pro 
* 980 985 990 

Leu Gly Pro Ser Ser Leu Ala Val His Thr lie Leu Pro Gin Glu Ser 

995 1000 

Pro Ala Leu Pro Thr Ser Leu Pro Ser Ser Leu Val Pro Pro Val Thr 
1010 1015 1020 

Ala Ala Gin Phe Leu Thr Pro Pro Ser Gin His Ser Tyr Ser Ser Pro 
1025 1030 1035 

Val Asp Asn Thr Pro Ser His Gin Leu Gin Val Pro Val Pro Val Met 

1045 1050 

Val Met lie Arg Ser Ser Asp Pro Ser Lys Gly Ser ser lie Leu lie 

1060 1065 

Glu Ala Pro Asp Ser Trp 
1075 

(2) INFORMATION FOR SEQ ID NO: 12: 
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-108- 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4268 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE : cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2.. 1972 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

G GAG GTG GAT GTG TTA GAT GTG AAT GTC CGT GGC CCA GAT GGC TGC 46 
Glu Val Asp Val Leu Asp Val Asn Val Arg Gly Pro Asp Gly Cys 
15 10 15 

ACC CCA TTG ATG TTG GCT TCT CTC CGA GGA GGC AGC TCA GAT TTG AGT 94 
Thr Pro Leu Met Leu Ala Ser Leu Arg Gly Gly Ser Ser Asp Leu Ser 
20 25 30 

GAT GAA GAT GAA GAT GCA GAG GAC TCT TCT GCT AAC ATC ATC ACA GAC 142 
Asp Glu Asp Glu Asp Ala Glu Asp Ser Ser Ala Asn lie lie Thr Asp 
35 40 45 

TTG GTC TAC CAG GGT GCC AGC CTC CAG GCC CAG ACA GAC CGG ACT GGT 190 
Leu Val Tyr Gin Gly Ala Ser Leu Gin Ala Gin Thr Asp Arg Thr Gly 
50 55 60 

GAG ATG GCC CTG CAC CTT GCA GCC CGC TAC TCA CGG GCT GAT GCT GCC 238 
Glu Met Ala Leu His Leu Ala Ala Arg Tyr Ser Arg Ala Asp Ala Ala 
65 70 75 

AAG CGT CTC CTG GAT GCA GGT GCA GAT GCC AAT GCC CAG GAC AAC ATG 286 
Lvs Arq Leu Leu Asp Ala Gly Ala Asp Ala Asn Ala Gin Asp Asn Met 
80 85 90 95 

GGC CGC TGT CCA CTC CAT GCT GCA GTG GCA GCT GAT GCC CAA GGT GTC 334 
Glv Arg Cys Pro Leu His Ala Ala Val Ala Ala Asp Ala Gin Gly Val 
100 105 110 

TTC CAG ATT CTG ATT CGC AAC CGA GTA ACT GAT CTA GAT GCC AGG ATG 382 
Phe Gin lie Leu lie Arg Asn Arg Val Thr Asp Leu Asp Ala Arg Met 
115 120 125 

AAT GAT GGT ACT ACA CCC CTG ATC CTG GCT GCC CGC CTG GCT GTG GAG 430 
Asn Asp Gly Thr Thr Pro Leu lie Leu Ala Ala Arg Leu Ala Val Glu 
130 135 140 

GGA ATG GTG GCA GAA CTG ATC AAC TGC CAA GCG GAT GTG AAT GCA GTG 478 
Gly Met Val Ala Glu Leu lie Asn Cys Gin Ala Asp Val Asn Ala Val 
145 150 155 

GAT GAC CAT GGA AAA TCT GCT CTT CAC TGG GCA GCT GCT GTC AAT AAT 526 
Asp Asp His Gly Lys Ser Ala Leu His Trp Ala Ala Ala Val Asn Asn 
160 165 170 175 

GTG GAG GCA ACT CTT TTG TTG TTG AAA AAT GGG GCC AAC CGA GAC ATG 574 
Val Glu Ala Thr Leu Leu Leu Leu Lys Asn Gly Ala Asn Arg Asp Met 
180 185 190 

CAG GAC AAC AAG GAA GAG ACA CCT CTG TTT CTT GCT GCC CGG GAG GGG 622 
Gin Asp Asn Lys Glu Glu Thr Pro Leu Phe Leu Ala Ala Arg Glu Gly 
195 200 205 
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AGC TAT GAA GCA GCC AAG ATC CTG TTA GAC CAT TTT GCC AAT CGA GAC 670 
Ser Tyr Glu Ala Ala Lys He Leu Leu Asp His Phe Ala Asn Arg Asp 
210 215 220 

ATC ACA GAC CAT ATG GAT CGT CTT CCC CGG GAT GTG GCT CGG GAT CGC 718 
He Thr Asp His Met Asp Arg Leu Pro Arg Asp Val Ala Arg Asp Arg 
225 230 235 

ATG CAC CAT GAC ATT GTG CGC CTT CTG GAT GAA TAC AAT GTG ACC CCA 766 
Met His His Asp lie Val Arg Leu Leu Asp Glu Tyr Asn Val Thr Pro 
240 245 250 255 

AGC CCT CCA GGC ACC GTG TTG ACT TCT GCT CTC TCA CCT GTC ATC TGT 814 
Ser Pro Pro Gly Thr Val Leu Thr Ser Ala Leu Ser Pro Val He Cys 
260 265 270 

GGG CCC AAC AGA TCT TTC CTC AGC CTG AAG CAC ACC CCA ATG GGC AAG 862 
Gly Pro Asn Arg Ser Phe Leu Ser Leu Lys His Thr Pro Met Gly Lys 
275 280 285 

AAG TCT AGA CGG CCC AGT GCC AAG AGT ACC ATG CCT ACT AGC CTC CCT 910 
Lvs Ser Arg Arg Pro Ser Ala Lys Ser Thr Met Pro Thr Ser Leu Pro 
3 290 295 300 

AAC CTT GCC AAG GAG GCA AAG GAT GCC AAG GGT AGT AGG AGG AAG AAG 958 
Asn Leu Ala Lys Glu Ala Lys Asp Ala Lys Gly Ser Arg Arg Lys Lys 
305 310 315 

TCT CTG AGT GAG AAG GTC CAA CTG TCT GAG AGT TCA GTA ACT TTA TCC 1006 
Ser Leu Ser Glu Lys Val Gin Leu Ser Glu Ser Ser Val Thr Leu Ser 
320 325 330 335 

CCT GTT GAT TCC CTA GAA TCT CCT CAC ACG TAT GTT TCC GAC ACC ACA 1054 
Pro Val Asp Ser Leu Glu Ser Pro His Thr Tyr Val Ser Asp Thr Thr 
340 345 350 

TCC TCT CCA ATG ATT ACA TCC CCT GGG ATC TTA CAG GCC TCA CCC AAC 1102 
Ser Ser Pro Met He Thr Ser Pro Gly He Leu Gin Ala Ser Pro Asn 
355 360 365 

CCT ATG TTG GCC ACT GCC GCC CCT CCT GCC CCA GTC CAT GCC CAG CAT 1150 
Pro Met Leu Ala Thr Ala Ala Pro Pro Ala Pro Val His Ala Gin His 
370 375 380 

GCA CTA TCT TTT TCT AAC CTT CAT GAA ATG CAG CCT TTG GCA CAT GGG 1198 
Ala Leu Ser Phe Ser Asn Leu His Glu Met Gin Pro Leu Ala His Gly 
385 390 395 

GCC AGC ACT GTG CTT CCC TCA GTG AGC CAG TTG CTA TCC CAC CAC CAC 1246 
Ala Ser Thr Val Leu Pro Ser Val Ser Gin Leu Leu Ser His His HiB 
400 405 410 415 

ATT GTG TCT CCA GGC AGT GGC AGT GCT GGA AGC TTG AGT AGG CTC CAT 1294 
He Val Ser Pro Gly Ser Gly Ser Ala Gly Ser Leu Ser Arg Leu His 
420 425 430 

CCA GTC CCA GTC CCA GCA GAT TGG ATG AAC CGC ATG GAG GTG AAT GAG 1342 
Pro Val Pro Val Pro Ala Asp Trp Met Asn Arg Met Glu Val Asn Glu 
435 440 445 

ACC CAG TAC AAT GAG ATG TTT GGT ATG GTC CTG GCT CCA GCT GAG GGC 1390 
Thr Gin Tyr Asn Glu Met Phe Gly Met Val Leu Ala Pro Ala Glu Gly 
450 455 460 

ACC CAT CCT GGC ATA GCT CCC CAG AGC AGG CCA CCT GAA GGG AAG CAC 1438 
Thr His Pro Gly He Ala Pro Gin Ser Arg Pro Pro Glu Gly Lys His 
465 470 475 
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ATA 
He 
480 


ACC 
Thr 


ACC 
Thr 


CCT 
Pro 


CGG 
Arg 


GAG 
Glu 
485 


CCC 
Pro 


TTG 
Leu 


CCC 
Pro 


CCC 
Pro 


ATT 
I le 
490 


GTG 
val 


ACT 
Tnr 


TTC 
rfie 


CAG 
win 


CTC 
Leu 
495 


ATC 
He 


CCT 
Pro 


AAA 
Lys 


GGC 
Gly 


AGT 
Ser 
500 


ATT 
He 


GCC 
Ala 


CAA 
Gin 


CCA 
Pro 


GCG 
Ala 
505 


GGG 
Gly 


GCT 
Aia 


CCC 
Pro 


CAG 


CCT 
fiO 

510 


CAG 

Pin 

win 


TCC 
Ser 


ACC 
Thr 


TGC 
Cys 


CCT 
Pro 
515 


CCA 
Pro 


GCT 
Ala 


GTT 
Val 


GCG GGC 
Ala Gly 
520 


CCC 
Pro 


CTG 
Leu 


CCC 
Pro 


ACC 

rnU 

i nr 


ATG 

neb 

525 


TAC 
Tyr 


CAG 

fit n 


ATT 
He 


CCA 
Pro 


GAA 
Glu 
530 


ATG 
Met 


GCC 
Ala 


CGT 
Arg 


TTG 
Leu 


CCC 
Pro 
535 


AGT 
Ser 


GTG 
vai 


GCT 

Aid 


TTC 
x*ne 


CCC 
Pro 
540 


ACT 

Thy 


GCC 

nla 


ATG 


ATG 
Met 


CCC 
Pro 
545 


CAG 
Gin 


CAG 
Gin 


GAC 
Asp 


GGG 
Gly 


CAG 
Gin 
550 


GTA 
Val 


GCT 
Ala 


CAG 
Gin 


ACC 
Thr 


ATT 

T 1 _ 

He 
555 


CTC 
Leu 


CCA 
Pro 


GCC 
Aia 


TAT 

Tyr 


CAT 
His 
560 


CCT 
Pro 


TTC 
Phe 


CCA 
Pro 


GCC 
Ala 


TCT 
Ser 
565 


GTG 
Val 


GGC AAG 
Gly Lys 


TAC 
Tyr 


CCC 
Pro 
570 


ACA 
Thr 


CCC 
Pro 


CCT 
Pro 


TCA 
Ser 


CAG 
Gin 
575 


CAC 
His 


AGT 
Ser 


TAT 
Tyr 


GCT 
Ala 


TCC 
Ser 
580 


TCA 
Ser 


AAT 
Asn 


GCT 
Ala 


GCT 
Ala 


GAG 
Glu 
585 


CGA 
Arg 


ACA 
Thr 


CCC 
pro 


AGT 
ser 


CAC 
nlS 
590 


AGT 
ser 


GGT 
Gly 


CAC 
His 


CTC 
Leu 


CAG 
Gin 
595 


GGT 
Gly 


GAG 
Glu 


CAT 
His 


CCC 
Pro 


TAC 
Tyr 
600 


CTG 
Leu 


ACA 
Thr 


CCA 
Pro 


TCC 

C A V 


CCA 
Pro 
605 


GAG 

<j1U 


TCT 
ser 


CCT 
Pro 


GAC 
Asp 


CAG 
Gin 
610 


TGG 
Trp 


TCA 
Ser 


AGT 
Ser 


TCA 
Ser 


TCA 
Ser 
615 


CCC 
Pro 


CAC 
His 


TCT 
Ser 


GCT 
Ala 


TCT 
Ser 
620 


GAC 
Asp 


TGG 
Trp 


TCA 
Ser 


GAT 
Asp 


GTG 
Val 
625 


ACC 
Thr 


ACC 
Thr 


AGC 
Ser 


CCT 
Pro 


ACC 
^hr 
630 


CCT 
Pro 


GGG 
Gly 


GGT 
Gly 


GCT 
Ala 


GGA 
Gly 
635 


GGA 
Gly 


GGT 
Gly 


CAG 
Gin 


CGG 
Arg 


GGA 
Gly 
640 


CCT 
Pro 


GGG ACA 
Gly Thr 


CAC 
His 


ATG 
Met 
645 


TCT 
Ser 


GAG 
Glu 


CCA 
Pro 


CCA 
Pro 


CAC 
His 
650 


AAC 
Asn 


AAC 
Asn 


ATG 
Met 


CAG 
Gin 


GTT 
Val 
655 



1486 



1534 



1582 



1630 



1678 



1726 



1774 



1822 



1870 



1918 



1966 



TAT GCG TGAGAGAGTC CACCTCCAGT GTAGAGACAT AACTGACTTT TGTAAATGCT 2022 
Tyr Ala 



GCTGAGGAAC 


AAATGAAGGT 


CATCCGGGAG 


AGAAATGAAG 


AAATCTCTGG 


AGCCAGCTTC 


2082 


TAGAGGTAGG 


AAAGAGAAGA 


TGTTCTTATT 


CAGATAATGC 


AAG AG AAG C A 


ATTCGTCAGT 


2142 


TTCACTGGGT 


ATCTGCAAGG 


CTTATTGATT 


ATTCTAATCT 


AATAAGACAA 


GTTTGTGGAA 


2202 


ATGCAAGATG 


AATACAAGCC 


TTGGGTCCAT 


GTTTACTCTC 


TTCTATTTGG 


AGAATAAGAT 


2262 


GGATGCTTAT 


TGAAGCCCAG 


ACATTCTTGC 


AGCTTGGACT 


GCATTTTAAG 


CCCTGCAGGC 


2322 


TTCTGCCATA 


TCCATGAGAA 


GATTCTACAC 


TAGCGTCCTG 


TTGGGAATTA 


TGCCCTGGAA 


2382 


TTCTGCCTGA 


ATTGACCTAC 


GCATCTCCTC 


CTCCTTGGAC 


ATTCTTTTGT 


CTTCATTTGG 


2442 


TGCTTTTGGT 


TTTGCACCTC 


TCCGTGATTG 


TAGCCCTACC 


AGCATGTTAT 


AGGGCAAGAC 


2502 


CTTTGTGCTT 


TTGATCATTC 


TGGCCCATGA 


AAGCAACTTT 


GGTCTCCTTT 


CCCCTCCTGT 


2562 



CTTCCCGGTA TCCCTTGGAG TCTCACAAGG TTTACTTTGG TATGGTTCTC AGCACAAACC 2622 
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TTTCAAGTAT 
CATTCCTGGA 
ATCCCTTCAA 
TTACCAGGAA 
GTTTTATCCT 
GTTTGGAATG 
CACAGGTCCC 
CCCAGATCTG 
AACTGTTGAG 
CCATTTAGGA 
CAGATGATCA 
TCACACCGTG 
TTATATGTTC 
CTATAAGAAG 
TTCTTTTCCT 
AACATTTTCC 
TTGTTCAGAC 
TCTTTGACTT 
CTTCCTCTTC 
AGTGCTGTGA 
AACAGGTGAA 
AAGTATGCAT 
CCTGCGTGTC 
TGAACCAACA 
TGCTTTATTC 
TATCATTCTA 
ACCTATCTGC 
GCGATGGCGA 



GTTGTTTCTT 
GAGAGAAGGG 
GAGGCTGCAC 
GAAGGGTGTG 
TGATAGTCTA 
TTGGAATGAC 
CCTACTTCCT 
AGCATTCTAG 
CCTTTCCTTT 
CTGAACTTTC 
GTCTCTACTG 
TGGTCCGTGT 
AAGTGCAGGA 
GTTGGAAAAG 
CTGAAGCGGC 
TTTCCTAGAG 
TGATTGCCCC 
TCTTTTAAGG 
TCCTTGGGCC 
TGCCCATGAC 
TTCCCGACTC 
CAGTGCTTCC 
TGTTGGCATA 
AAAATAATTA 
TCTCCATGTG 
AATGGTGACT 
ATGGACCCTC 
TGACTTTCTT 



TGGAAAATGG 
GAGAAGAATA 
CTTAATTTTT 
AGTTTGTTGT 
GTTACTATGA 
CAAGAGACAA 
GCCAAGCATT 
GCCTGTTTCA 
CATATCCACA 
CTTAGCCCAA 
ATTATCTTGC 
TACTGGTATA 
ATTGGAAAGT 
GAGGAACTAT 
CATGACATTC 
TCACCTTTTA 
TCACCTGAAT 
GCAGAAGCAT 
AGTTAATAAT 
ACCTGCAAAA 
TTTTGGTTTG 
CACTTACCTG 
ATAGTTTACA 
CTTCTGCCCT 
GCAACATTCT 
CTCTGCCCTT 
ACCATCCTCT 
CCCCTG 



-111- 
ACATACTGTA 
CTTTTCTTCA 
CTTGTCTGTG 
TTTTCTGTGT 
CCCTCCCCAC 
GTTAACTCGT 
CCATTGACTG 
CTCACTCACC 
GAAGACACTG 
GGGACCCAGT 
TGCTTAAAGG 
CCCAGTATGT 
TGGACTTGTT 
ATAGCAGCCT 
CCTTTGGCAA 
GATGATAATG 
CCACTCTCTG 
TTTAGTTAAT 
TGGTCCATGG 
TAAGTTCTGC 
AATGACAGTT 
ATTTGTCTGT 
AATGGTTTTT 
GAGATAAGCA 
GTCAGCCTCT 
GGACCCATTT 
GTGCAGCACA 



TTGTGTTCTC 
ACAAATTTTG 
TGCAGGTCTT 
ATGGGCCTGG 
TTTTTTAAAA 
GCAAGAGCCA 
CCTGTATGGA 
CAGCATATGA 
TCTCAAATGT 
GACAGTTGTC 
CCTGCTCACC 
TCTCACTGAA 
TTCTATGATC 
TTGCTATTTT 
CTAACGTAGA 
GACAACTATA 
TATTCATGCT 
TGTAGATAAA 
CTACACTGCA 
CTGGGCATTT 
CTCATTCCTT 
CGGTGGCCCC 
TCAGTCCTAT 
GATTAAGTTT 
TTCATAGTGT 
ATTATTCACA 
CACAGTGCAG 



CTGCATATAT 
GGGGCAGGAG 
CATATAAACT 
TCAGTGTAAA 
CCAGAAAAAG 
GTTACCCACC 
ACACATTTGT 
AACTAGTCTT 
TGTACCCTTG 
TTCCGTTTGT 
AATCTTTCTT 
GACATGGACT 
CAAAACAGCC 
CTGCTACCAT 
AACTCAACAG 
GACTTGCTCA 
CTTGGCAATT 
GAATAGTTTT 
ACTTCCGTCC 
TG TAG AT ATT 
CTATGGCTGC 
ATATGGAAAC 
CCAAATTTAT 
GTTCATTCTC 
GCAAACATTT 
GATGGGGAGA 
GGAGCCAGTG 



2682 

2742 

2802 

2862 

2922 

2982 

3042 

3102 

3162 

3222 

3282 

3342 

3402 

3462 

3522 

3582 

3642 

3702 

3762 

3822 

3882 

3942 

4002 

4062 

4122 

4182 

4242 

4268 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 657 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
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Glu Val Asp Val Leu Asp Val Asn Val Arg Gly Pro As P Cly Cys Thr 
Pro Leu Met Leu Ala Ser Leu Arg Gly Gly Ser Ser Asp Leu Ser Asp 



20 



Met Ala Leu His Leu Aia ma «ry — " * 80 

70 10 



Glu Asp Glu Asp Ala Glu Asp Ser Ser Ala Asn lie lie Thr Asp Leu 

val Tyr Gin Gly Ala Ser Leu Gin Ala Gin Thr Asp Arg Thr Gly Glu 
50 55 

His Leu Ala Ala Arg Tyr Ser Arg Ala Asp Ala Ala Lys 

65 

Arg Leu Leu Asp Ala Gly Ala Asp Ala Asn Ala Gin Asp Asn Met Gly 

* 85 90 

Arg Cys Pro Leu His Ala Ala Val Ala Ala Asp Ala Gin Gly Val Phe 

100 105 

Gin lie Leu lie Arg Asn Arg Val Thr Asp Leu Asp Ala Arg Met Asn 

115 120 " s 

Asp Gly Thr Thr Pro Leu lie Leu Ala Ala Arg Leu Ala Val Glu Gly 

Val 

145 150 155 "° 



130 135 
Met Val Ala Glu Leu lie Asn Cys Gin Ala Asp Val Asn Ala Val Asp 



Asp His Cly Lys Ser Ala Leu His Trp Ala Ala Ala Val Asn Asn Val 

* ific 170 



165 

Glu Ala Thr Leu Leu Leu Leu Lys Asn Gly Ala Asn Arg Asp Met Gin 
180 185 

Asp Asn Lys Glu Glu Thr Pro Leu Phe Leu Ala Ala Arg Glu Gly Ser 
r 195 200 ^ Ub 

Tyr Glu Ala Ala Lys lie Leu Leu Asp His Phe Ala Asn Arg Asp lie 

210 215 
Thr Asp His Met Asp Arg Leu Pro Arg Asp Val Ala Arg Asp Arg Met 

225 230 

His His Asp lie Val Arg Leu Leu Asp Glu Tyr Asn Val Thr Pro Ser 



245 



Pro Pro Gly Thr Val Leu Thr Ser Ala Leu Ser Pro Val lie Cys Gly 

260 265 
Pro Asn Arg Ser Phe Leu Ser Leu Lys His Thr Pro Met Gly Lys Lys 

Ser Arg Arg Pro Ser Ala Lys Ser Thr Met Pro Thr Ser Leu Pro Asn 



275 2B0 285 

Pro Ser Ala 
290 295 
Leu Ala Lys Glu Ala Lys Asp Ala Lys Gly ser Arg Arg Lys Lys Ser 
305 31° J 

Leu ser Glu Lys Val Gin Leu Ser Glu Ser Ser Val Thr Leu ser Pro 

325 JJU 

Val Asp Ser Leu Glu Ser Pro His Thr Tyr Val ser Asp Thr Thr Ser 

340 345 
Ser Pro Met lie Thr Ser Pro Gly He Leu Gin Ala ser Pro Asn Pro 
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365 



„et Leu Ala Thr Ala Ala Pro Pro Ala Pro Val His Ala cin His Ala 

370 37b 

uio rlu Met Gin Pro Leu Ala His Gly Ala 
Leu Ser Phe Ser Asn Leu His Glu Met cm 4Q0 

385 390 

Se r Thr Val Leu Pro Ser Val Ser Cin Leu Leu Ser His His His II. 

Val ser Pro Gly L Oly ser Ala Gly Ser Leu Ser Arg Leu His Pro 

420 

v., Pro v,l n. »U »P Trp » f » «, »« «. va » «. * 

435 

* n« Met Phe Gly Met Val Leu Ala Pro Ala Glu Gly Thr 
Gin Tyr Asn Glu Met pne oiy nc ^ 

450 4 " 
His Pro Gly He Ala Pro Cin Ser Arg Pro Pro Clu Cly Lys His lie 
465 470 

Th r Thr Pro Arg Glu Pro Leu Pro Pro lie Val Thr Phe Gin Leu lie 

Pro Lys Gly Ser lie Ala Cin Pro. Ala Cly Ala Pro Gin Pro Gin Ser 

500 

» &i» val Ala Gly Pro Leu Pro Thr Met Tyr Gin lie 
Thr Cys Pro Pro Ala Val Ala t^y g25 

515 

Pro Clu Met Ala Arg Leu Pro Ser Val Ala Phe Pro Thr Ala Met Met 

530 535 
Pro Gin Gin Asp Cly Cin Val Ala Cin Thr lie Leu Pro Ala Tyr His 
545 550 

Pro Phe Pro Ala Ser Val Cly Lys Tyr Pro Thr Pro Pro Ser Cin Hxs 

565 

ser Tyr Ala Ser Ser Asn Ala Ala Clu Arg Thr Pro Ser HiS Q Ser Cly 

580 bttb 
His Leu Gin Gly Glu His Pro Tyr Leu Thr Pro Ser Pro Clu Ser Pro 

595 wv 

MP «, se, _ M s„ «. «. - 1. g -» «» - *"> 

610 615 
val Thr Thr Ser Pro Thr Pro Gly Gly Ala Gly Cly cly cm Arg Gly 

625 630 

Pra ely a, Hi. »« s„ u. «. « S ; «-» « »« «" Si T 



Ala 



(2) INFORMATION FOR SEQ ID NO: 14: 



(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 77 amino acids 

(B) TYPE: amino acid 

C STRANPEDNESS: single 
(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

01. Asp lie Asp 01. Cys Asp .in oly - Pro Cys oiu His Asn Gly 

n. Cys Val Asn Tnr Pro Oly Ser Tyr Arg Cys Asn Cys Ser Oln Oly 

20 " 
Ph e Thr Oly Pro Arg Cys Olu Thr Asn He Asn Ol« Cys Olu Ser His 

Pro cys l Asn Olu Oly Ser Cys Leu Asp Asp Pro Oly Thr Phe Arg 

50 55 
B Val Cy S Met Pro Gly Phe Thr Gly Thr Gin Cy S Glu 



Cy 
65 

2) INFORMATION FOR SEQ ID NO: 15: 



(2) 



tl) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(li) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Asn Asp Val Asp Glu Cys Ser Leu Gly Ala Asn Pro Cys Glu His Gly 

Oly Arg Cys Thr Asn Thr Leu Gly Ser Phe Gin Cys Asn Cys Pro Gin 
20 " 



Gly Tyr Ala Oly Pro Arg Cys Olu lie Asp Val Asn Olu Cys Leu Ser 

A6 „ Pro cys Oln Asn Asp Ser Thr Cys Leu Asp Oln He Oly Olu Phe 

50 55 
Gin Cys lie Cys Met Pro Gly Tyr Glu Gly Leu Tyr Cys Glu 



65 70 



(2) INFORMATION FOR SEQ ID NO: 16: 

(il SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 654 amino acids 
/B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:16: 
Thr Pro Pro Gin Gly Glu lie Glu Ala Asp Cys Met Asp Val Asn Val 
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! 5 10 « 

Arg Gly Pro Asp Gly Phe Thr Pro Leu Met He Ala Ser Cys Ser Gly 

20 25 
Gly Gly Leu Glu Thr Gly Asn Ser Glu Glu Glu Glu Asp Ala Ser Ala 



35 40 

lie Ser Asp Phe lie Gly Gin Gly Ala Gin Leu His Asn Gin 



Asn Met hb «*-r- -- - 6 q 

50 55 



Thr Asp Arg Thr Gly Glu Thr Ala Leu His Leu Ala Ala Arg Tyr Ala 
65 70 75 

Arg Ala Asp Ala Ala Lys Arg Leu Leu Glu Ser Ser Ala Asp Ala Asn 



Val Gin Asp Asn Met Gly Arg Thr Pro Leu His Ala Ala Val Ala Ala 

100 105 
AS p Ala Gin Gly Val Phe Gin lie Leu lie Arg Asn Arg Ala Thr Asp 



115 



Leu 



Asp Ala Arg Met Phe Asp Gly Thr Thr Pro Leu lie Leu Ala Ala 
130 135 

A rg Leu Ala Val Glu Gly Met Val Glu Glu Leu lie Asn Ala His Ala 
14 5 150 155 

Asp Val Asn Ala Val Asp Glu Phe Gly Lys Ser Ala Leu His Trp Ala 

r 165 170 

Ala Ala Val Asn Asn Val Asp Ala Ala Ala Val Leu Leu Lys Asn Ser 

Ser 

195 200 205 



180 185 
Ala Asn Lys Asp Met Gin Asn Asn Lys Glu Glu Thr Ser Leu Phe Leu 



Ala Ala Arg Glu Gly Ser Tyr Glu Thr Ala Lys Val Leu Leu Asp His 

Arg 

lie Ala Gin Glu Arg Met His His Asp lie Val His Leu Leu Asp Glu 



210 215 
Tyr Ala Asn Arg Asp lie Thr Asp His Met Asp Arg Leu Pro Arg Asp 

225 230 



245 



Tyr Asn Leu Val Lys Ser Pro Thr Leu His Asn Gly Pro Leu Gly Ala 

260 265 
Thr Thr Leu Ser Pro Pro He Cys Ser Pro Asn Gly Tyr Met Gly Asn 

275 2 ^ 
Met Lys Pro Ser Val Gin Ser Lys Lys Ala Arg Lys Pro Ser lie Lys 



290 



295 



ly Asn Gly Cys Lys Glu Ala Lys Glu Leu Lys Ala Arg Arg Lys Lys 



G 

305 



310 



Ser Gin Asp Gly Lys. Thr Thr Leu Leu Asp Ser Gly Ser Ser Gly Val 
Leu Ser Pro Val Asp Ser Leu Glu Ser Thr His Gly Tyr Leu Ser Asp 



340 345 

355 



Val Ser Ser Pro Pro Leu Met Thr Ser Pro Phe Gin Gin Ser Pro Ser 
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Het Pro Leu Asn His Leu Thr Ser Met Pro Glu Ser Gin Leu Gly Met 



370 



375 



Asn His lie Asn Met Ala Thr Lys Gin Glu Met Ala Ala Gly Ser Asn 
385 390 395 

Ar g Met Ala Phe Asp Ala Met Val Pro Arg Leu Thr Hi B Leu Asn Ala 

Ser Ser Pro Asn Thr lie Met Ser Asn Gly Ser Met His Phe Thr Val 
420 425 

Gly Gly Ala Pro Thr Met Asn Ser Gin Cys Asp Trp Leu Ala Arg Leu 

435 440 
Gin Asn Gly Met Val Gin Asn Gin Tyr Asp Pro lie Arg Asn Gly lie 

450 455 
Gin Gin Gly Asn Ala Gin Gin Ala Gin Ala Leu Gin His Gly Leu Met 
465 470 

Thr Ser Leu His Asn Gly Leu Pro Ala Thr Thr Leu Ser Gin Met Met 
485 490 

Thr Tyr Gin Ala Met Pro Asn Thr Arg Leu Ala Asn Gin Pro His Leu 
500 505 

Met Gin Ala Gin Gin Met Gin Gin Gin Gin Asn Leu Gin Leu His Gin 
515 520 525 

Ser Met Gin Gin Gin His His Asn Ser Ser Thr Thr Ser Thr His lie 
530 535 540 

Asn Ser Pro Phe Cys Ser Ser Asp lie Ser Gin Thr Asp Leu Gin Gin 
545 550 6S:> 

Met Ser Ser Asn Asn He His Ser Val Met Pro Gin Asp Thr Gin lie 
565 570 =>' = 

Phe Ala Ala Ser Leu Pro Ser Asn Leu Thr Gin Ser Met Thr Thr Ala 

580 585 

Gin Phe Leu Thr Pro Pro Ser Gin His Ser Tyr Ser Ser Pro Met Asp 
595 600 &U5 

Asn Thr Pro Ser His Gin Leu Gin Val Pro Asp His Pro Phe Leu Thr 

610 615 620 

Pro Ser Pro Glu Ser Pro Asp Gin Trp Ser Ser Ser Ser Pro His Ser 
625 630 

Asn Met Ser Asp Trp Ser Glu Gly He Ser Ser Pro Pro Thr 
645 650 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 666 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

<ii) MOLECULE TYPE: peptide 



xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 



* • 
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Thr Pro Pro Gin Gly Glu Val Asp Ala Asp Cys Met Asp Val Asn Val 

Arg Gly Pro Asp Gly Phe Thr Pro Leu Het He Ala Ser Cys Ser Gly 

20 " 
Gly Gly Leu Glu Thr Gly Asn Ser Glu Glu Glu Glu Asp Ala Pro Ala 

35 40 
val lie ser Asp Phe lie Tyr Gin Gly Ala Ser Leu His Asn Gin Thr 



50 



55 



Asp Arg Thr Gly Glu Thr Ala Leu His Leu Ala Ala Arc Tyr Ser Arg 

Ser Asp Ala Ala Lys Arg Leu Leu Glu Ala Ser Ala Asp Ala Asn lie 

85 

Gin Asp Asn Met Gly Arg Thr Pro Leu His Ala Ala Val Ser Ala Asp 

100 

Ala Gin Gly Val Phe Gin He Leu Leu Arg Asn Arg Ala Thr Asp Leu 

115 120 
Asp Ala Arg Met His Asp Gly Thr Thr Pro Leu lie Leu Ala Ala Arg 

130 135 
Leu Ala Val Glu Gly Met Leu Glu Asp Leu lie Asn Ser His Ala Asp 
145 I 50 " 5 

Val Asn Ala Val Asp Asp Leu Gly Lys Ser Ala Leu His Trp Ala Ala 



165 



Ala val Asn Asn Val Asp Ala Ala Val Val Leu Leu Lys Asn Gly Ala 



180 



uo*. rin Asn Asn Lys Glu Glu Thr Pro Leu Phe Leu Ala 
Asn Lys Asp Met Gin Asn Asn i,y» 2QS 



195 200 



Ala Arg Glu Gly Ser Tyr Glu Thr Ala Lys Val Leu Leu Asp His Phe 



210 



215 



Ala Asn Arg Asp lie Thr Asp His Met Asp Arg Leu Pro Arg Asp lie 

225 230 

Ala Gin Glu Arg Met His His As P lie Val Arg Leu Leu Asp Glu Tyr 

245 Z3U 
Asn Leu Val Arg Ser Pro Gin Leu His Gly Thr Ala Leu Gly Gly Thr 



260 



Pro Thr Leu Ser Pro Thr Leu Cys Ser Pro Asn Gly Tyr Leu Gly Asn 



275 



280 



Ala Thr Gin Gly Lys Lys Ala Arg Lys Pro Ser Thr Lys 
290 295 
Gly Leu Ala Cys Ser Ser Lys Glu Ala Lys Asp Leu Lys Ala Arg Arg 

305 310 

Lys Lys Ser Gin Asp Gly Lys Gly Cys Leu Leu Asp Ser Ser Ser Met 

teu Ser Pro Val Asp Ser Leu Glu Ser Pro His Gly Tyr Leu Ser Asp 

340 J ^ 
Val Ala Ser Pro Pro Leu Pro Ser Pro Phe Gin Gin Ser Pro Ser Met 
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„ U. « Hi. W. W> «| « » »•» «» S " 

370 ■* ,!> 

„. „ «. «. «. »» >•*= «• «■ "* » la Le " m ° ly SS 

385 390 



S „ Leu M» P»« .i. ™> « « HJ "* " "* «» **° 

405 



v.! »u ~ s.r ». « «- V.I Lg ser T„r »n .1, Thr .iy 

420 4 " 

„. t » Ph. <» «i «, u. ™ «• «« - « a G1 " °" 01 " 

435 

, * moi- val Pro Ser Gin Tyr Asn Pro 

Trp Leu Pro Arg Leu Gin Asn Gly Met Val Pro ^ 

450 45b 

Leu Arg Pro Gly Val Thr Pro Gly Thr Leu Ser Thr .In Ala Ala Gly 

465 470 

Le u cm Hi, Gly Met Met ser Pro He His Ser Ser Leu Ser Thr Asn 

Thr L eu ser Pro Z He Tyr Gin Gly Leu Pro Asn Thr Arg Leu Ala 

500 bu:> 
Thr Gin Pro His Leu Val Gin Thr Gin Gin Val Gin Pro Gin Asn Leu 

515 bZU 
Gln lie Gin Pro Gin Asn Leu Gin Pro Pro Ser Gin Pro His Leu Ser 
530 

val ser Ser Ala Ala Asn Gly His Leu Gly Arg Ser Phe Leu Ser Gly 
545 550 

G1 » Pro Ser Gin Ala Asp Val Gin Pro Leu Gly Pro Ser Ser Leu Pro 

Val His Thr lie Z Pro Gin Glu Ser Gin Ala Leu Pro Thr Ser Leu 

580 bab 
Pro ser Ser Met Val Pro Pro Met Thr Thr Thr Gin Phe Leu Thr Pro 
595 600 

cor Pro Val Asp Asn Thr Pro Ser 
Pro Ser Gin His Ser Tyr Ser Ser Ser Pro val ^P 

610 615 

.U «. Leu .» v.! £ «• BU « » « »> ~ «■ SS 

625 630 

ser Pro Asp Gin Trp Ser Ser Ser Ser Arg His Ser Asn lie Ser Asp 



645 650 

Trp Ser Glu Gly He Ser Ser Pro Pro Thr 
* 660 

(2) INFORMATION FOR SEQ ID NO: 18: 

li) SEQUENCE CHARACTERISTICS: 
K (A) LENGTH: 681 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNBSS: single 

(D) TOPOLOGY: unknown 
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( ii) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Thr Pro Pro Cln Gly Olu Val Asp Ma Asp Cys Met Asp Val Asn Val 



, n n«n civ Phe Thr Pro Leu Met lie Ala Ser Cys Ser Gly 
Arg Gly Pro Asp Gly me xnr rj.u 3Q 

20 *° 
C i y Gly Leu Glu Thr Gly Asn Ser Glu ol« Glu olu Asp Ala Pro Ala 

Val lie 'sir A.p «» He Tyr Gin Gly Ala Ser Leu His Asn Gin Thr 

50 55 
Asp Arg Thr Gly Glu Thr Ala Leu His Leu Ala Ala Arg Tyr Ser Arg 

l Asp Ala Ala Lys Arg Leu Leu Glu Ala Ser Ala Asp Ala Asn lie 

Gin Asp Asn Met Gly Arg Thr Pro Leu His Ala Ala Val Ser Ala Asp 

100 1Ub 
Ala Gin Gly Val Phe Gin He Leu He A rg Asn Arg Ala Thr Asp Leu 

115 120 
Asp Ala Arg Met His Asp Gly Thr Thr Pro 



130 



Leu He Leu Ala Ala Arg 
140 



L eu Ala Val Glu Gly Met Leu Glu Asp Leu lie Asn Ser His Ala Asp 
145 150 

Val Asa Ala Val Asp Asp Leu Gly Lys Ser Ala Leu His Trp Ala Ala 

165 

„ i ie n ala Ala Val Val Leu Leu Lys Asn Gly Ala 
Ala Val Asn Asn Val Asp Ala Ala val vai u igQ 

180 18b 

» « lt1 n n Thr Pro Leu Phe Leu Ala 

Asn Lys Asp Met Gin. Asn Asn Arg Glu Glu Thr Pro ^ 

195 2UU 
Ala Arg Glu Gly Ser Tyr Glu Thr Ala Lys Val Leu Leu Asp His Phe 

Ala Z Arg Asp He Thr Asp His Met Asp Arg Leu Pro Arg Asp II. 

225 230 

Ala Gin Glu Arg Met His His Asp He Val Arg Leu Leu Asp Glu Tyr 

Asn Leu Val Arg Ser Pro Gin Leu His Gly Ala Pro Leu Gly Gly Thr 

260 

p„ T„r £. Mr Pro Pro ■» Cj; •« Pro «» *g «» »' 

M „ Ly . Z «y «1 .» Cly *• L,» « «, LjJ Pro - ~ W . 

290 29b 

.„„ r1n ai a tvs Asp Leu Lys Ala Arg Arg 
Gly Leu Ala Cys Gly Ser Lys Glu Ala Lys Asp ^ Q 



305 



# 
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325 330 

L e„ s.r Pro v.! Tsp Ser - .1. «| « »" «' ffi ~ 

340 JH3 

vu », « « ~ - in s « " ph ' G1 " SS s " Pro s " 

« Pro « »» »i. «• ° 1Y *" W » BU ^ 

370 J/:> 

oly Bl . «. ».» v., ju m» »° «• K »* »" - Gly SS 

385 39 . T 

oly «. « g « jj. «- «- «« «» ss ™ - ser !S M 

P„ V.! »» M. «, ™ Ser T„r V.1 Leu «T - ~ -5 "V "> 

420 ^ 



Ma Leu M. P- T»r V.! «, oly s.r «. « - »- «» 

435 44U 

^ cov. riv Met Val Pro Asn Gin Tyr Asn 
Glu Trp Leu Ser Arg Leu Gin Ser Gly Met va ^ 

450 * b5> 

» «. «. «, ~ »> «• •» p " s; s * r Gln au ss 

465 1,0 , „ 

S.r L«u 01. Hi. 01, »« V.. «» Pro Leu Hi. S.r s.r Leu Jl. 

s« „. Leu s.r Tn ». t H.t s.r Jg «• "V ™ "5 "* 

500 5U 

« u. s «• «• «• « & «" ™ r ol " 01n Si ol ° pt ° G1 ° 

« Leu " ~ «» «- «• « « ° l " "° SS "* 

530 535 

01„ 01- ser L., «. Pro Pro « « Jg «» « » 1 ' - III 

545 550 



v,l S er ser U. »• - - « J& - K "» 

565 

Glu Pro Ser Gin Ala Asp Val Oln Pro L« Gly Pro Ser Ser Leu Ala 

580 btt:> 

v.1 Hi. Thr n. Leu Pro «. JJ. « ™ - S ^ S " 

595 bUU 
P„ ser s.r Leu vei Pro Pro v.1 «r U. U. «o Phe Leu * Pro 

610 615 
P„ ser 01. Hi. S .r Tvr s.r S.r Pro vel cl„ ... «. P» « JJ. 
625 630 , B 

M . Leu „1» v.1 Pro =lu Hi. Pro ». « «• ™ - «• 'JS S " 

645 

- c.r ser Pro His Ser Asn Val Ser Asp Trp 

Pro Asp Gin Trp Ser Ser Ser ser rro 6?0 



660 



ser Glu Gly Val Ser Ser Pro Pro Thr 

675 00 
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(2 ) INFORMATION FOR SEQ ID NO: 19: 

( i) SEQUENCE CHARACTERISTICS: 

( it) LENGTH: 2471 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 



ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Rla Leu Arg Pro Ala Leu Leu Tr P Ala Leu Leu Ala Leu Trp 



Met Pro 
1 



wo cvs cvs >U « « - «• «• - «" C ' 6 " 9 5? " y ** 

20 Z3 



Glu Pro cys Val Asn Glu Gly Met Cys . ^ 

Gly T yr lye Lys Cys Pro Glu Gly Phe Leu Gly Glu Tyr Cys Gin His 

50 bb „ , 



Val Thr Tyr His Asn Gly Thr 



M9 ,sp ™ cv= «. a- « «• oin ;r 01y Gly Tnr '** - 

65 

, r *i > Thr cvs Arg Cys Ala Ser Gly Phe 
Ala Gin Ala Met Leu Gly Lys Ala Thr cys Arg y 95 

85 

«, « r «. »P CV- «■ T„ se, «J « - l\l S " 

„ Pro CVS « » «, "V g CV. Hi. M «- ffi « »» - 

115 " u 



Tyr 



0 W Cys « cvs cm « «, ™. « «» SB «« ** 

L eu Ser His Pro Cys Ala Asn Gly Ser Thr Cys Thr 

150 



Thr Asp Ala Cys 
145 



« ». » «. «- - <»• - as - Thr ni phe ~ 

165 



01 „ , y , ev. «. t« ».P « « «; «• « p " SS 

180 

Pro Gly Ser Tyr Gin Cys Gin 



Gin His Gly Gly Thr Cys Leu Asn Leu 
195 ^ 



205 



Cy 



Cy 
225 



Asp 



ri« Tvr cvs Asp Ser Leu Tyr Val Pro 
8 Pro Gin Gly The Thr Gly Gin Tyr cys a p ^ 

210 215 

. ,« ™ « » cv; «» « «v «, S « «■ * SI 

« « ,hs ... cvs M. cvs « 5 «» « «» «' SI "* 



245 



_ » en uig ara cvs Gin Asn Gly 

Cys Glu Arg Asn He Asp Asp cys Pro Asn His Arg ^ 

2 60 

cl , v.. cys «1 W .1, J- - V - «• S «" « « 

275 ^° 
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„. «, ~ «T -» » g| ~ «« - 53 " 

.„ Z » ». ;s « .» .» - a; Aa " M " - 

:: c - c - « - 3, - - - - % ~ 

„. » -p « - u. - - - <*• - - SB s " T " 

340 

Bho SGr CVB Met Cys Pro Glu Gly Lys 
Cys lie Asp Arg Val Ala ser Phe Ser Cys ^ 

Rla Gly Z ~ Cys H,s Leu -P MP - - Xj. ~ « « Cys 

His Z - - sb - Thr Asn pro a * sn Gly Gln Tyr K 

cln olv Tyr Lys Gly Ala Asp Cys Thr Glu Asp Val 
cys Thr Cys Pro Gin Gly Tyr i* ^ 415 

J 405 

n G n Pro Cvs Glu His Ala Gly Lys 
A8 p Glu cys Ala Met Ala Asn Ser Asn Pro Cys 
420 

» hl> His cvs Glu cys Leu Lys Gly Tyr 
Cys Val Asn Thr Asp Gly Ala Phe Hi. Cys ^ 

AlaGly Pr 5 oAr g Cy S GluMetA SpI leA S nGluCysH, S Ser Asp Pro 

CyB Z Asn Asp Ala Thr Cys Leu Asp Lys lie Cly Gly Phe Thr Cys 

465 „, ««i His Cvs Glu Leu Glu lie Asn 

L eu cys Met Pro Gly Phe Lys Gly Val Hi. Cy m 

oin Ser Asn Pro Cys Val Asn Asn Gly Gin Cys Val Asp Lys 
Glu Cys Gin Ser ash nu i 5Q5 Slu 

, rv. Pro Pro Gly Phe Thr Gly Pro Val 
Val Asn Arg Phe Gin Cys Leu Cys Pro ^ 

515 

o - c^y Thr Pro Cys Leu Asn Gly 

c oi» n= »«p i" »» tl! C1 " So 

u. " c y . n. « 5S « - - - B! - 35 

- .» - «. .u .u - - « jr ~ 

W » -U -U «, - ffi « •» «« « 115 ~ 

cys c » n. «v - «v - - - ~ "* "' 

« «. " - « s c " L " «° oy " "* " 

MU VU « «, TV, ox; c y . *. c yS ... S "V - « SJ 
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01y ne cy . £ »p «, « a s « °" val s " "° 

.„ Ph. Thr X ... «, «• U. «. MP »■ "» SS "* S " 



675 

» t vb Glv Ala Thr Cys He Asn Gly Val Asn Gly Phe 
Asn Pro Cys Arg I>ys Gly Ala inr y ?00 

690 by:> 

«, c y . ,1. cy. Pro .;; «» « - »» ?3 ~ "* " S 

705 

wi u» u. cy. » « "»= 5S " iS *" C " ™ 

My ». s.r .1, Tyr „. cy. «. cy. up »» «r W «J «r .u 

J 740 

T « A 0n Glu Cvs Leu Ser Asn Pro Cys Gin Asn 
Asn Cys Glu Val Asp Lys Asn Glu Cys Le ^ 

755 /t)U 

01y c ly Thr cy. up » - »•» «T ir *g «" C " Ly ' 

770 //5> 



Lys oi» Phe Lys Oly Tyr Asn Cys Gin Val Asn He Asp Glu Cys Ala 
785 790 . 

Ser AS n Pro Cys Leu Asn oln Oly Thr Cys Phe Asp Asp He Ser Gly 

^ Thr cys His Z Val Leu Pro Tyr Thr Gly Lys Asn Cys Oln Thr 

y 820 B " 

« U. U. « ~ « J- « - ft ^ 

835 

~i c,. Tvr Thr Cys Leu Cys Ala Pro Gly 
Ly8 Glu Ser Pro Asn Phe Glu Ser Tyr Thr cy ^ 

850 855 

Trp Gin Gly Gin Arg Cys Thr He Asp He Asp Glu Cys Ue Ser Lys 

865 870 

p ro cy. mt «. »i. "V U. «. «- ug cm «, s.r g; « 



865 

pva Met Asn 

885 

cy. «. cy. X. «. «T - - JS »" W ^ flS "* 

J 900 yuD 

Asp UP cy. u. U. « « g- «• - «" K ^ "* 

0l y Z Thr » 3., CJ. U. « « "J " y 

930 yj3 



Ly . cy. cm Thr up » U. .U cy. Le» s.r «. Pro cy. ,y. u. 

945 

01 y 01 y Thr cy. ser Asp Tyr v.l u» -J Tyr * cy. Ly. cy. .U 
u. dy Ph. UP Z «i Hi. <p «J « » .u « SS «■ - 

980 

«. s.r ser Cy. Ph. ». «T «J « C V= V.l up ^.U U. ser 

995 1UUU 
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w t nv Php Thr Gly Ser Phe Cys Leu His 
Phe Ser Cys Leu Cys Pro Val Gly Phe 1020 

Glu „. « «. CS - - «• «*' iS," ™ 35. 

1025 

w - rvs Ser cys Pro Leu Gly Tyr Thr 

Val Asp Gly Leu Gly Thr Tyr Arg Cys Se^ y 1Q55 

104b 

oly Ly , « rJ ,n », ». V., » U. <*• ~ « 

1060 

„. „ w ?5 «iy - c y . v., p S «. «« «»" <*• 

cy . „ « «• "» sr "° Vil set 

1090 1UVD 

^ MP ,1. «. ». « « « sn.* 1 01u ~ as. 
r» iB « .., v.i ^ «. - »• His Tyc as. ,ln 

112b 

^ ~n cor- Tvr Cvs Glu Glu Gin Leu Asp Glu 
Cys Pro Leu ^Tyr Thr Gly Ser ^Cys n50 

~, ti- f riu Ala Thr Cys Ser Asp Phe He 
Cy 8 Ala Ser Asn Pro Cys Gin Hi* Gly Ala u65 
1155 libU 

„ i Ts^ rt riv Tvr Gin Gly Val Asn Cys 
Gly Gly Tyr Arg Cys Glu cys Val Pro Gly Tyr Gi^ 

1170 117i> 

Glu ^ «. Val -»«•<»• «' - ° l " JS.** *" "°° 

1185 liyU 



II. Asp Leu val Asn His Phe Lys Cys Ser Cys Pro Pretty 

1205 12 1U 

*r «, «» - - <*= «■ 01 " IS.'" ^ "5o W5 

1220 " 



ann Glv Glv cm cys Met Asp Arg He Gly Gly Tyr 
Pro His CysLeu Asn Gly Gly bin^ y 124& 



s „ c. «, c,. ,.u ™ =i, r »u =iv «• «» «v w 

1250 

n„ a cor ser Glu Gly Ser Leu Asp 
He Asn Glu Cys Leu Ser Asn Pro Cys Ser 8.^01 128Q 

1265 " /U , _ 

cys „. « n «. « « -» ^ - c " - ser 

ft, «, X. «• Of «" ~ IS,*"* " l ~ S5."" "° 

1300 ^ 

cy , M „ » «, «, ~ <*■ f 3 tr »» s - - ;s 5 pr ° Mp " T 

1315 1JZU 

n. .1. cy- n cv. « « «, «- - «» »; 0 "' «" ol " - 

1330 13Jb 

s „ c c ly «. v.> Lj. cy. «, «. *■ «- IS. 

1345 , e 
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01y c„ u. « « «. oy. ... «. .1, «y - <*■ "» 

* 1380 1JO:3 

K9 C, » Pr» TV, Ty, ~ cv. o «» CV. «• « «- - »» 

1395 x * uu 

« „ =y. «« - w J*- "° P " ~ I«o Pt ° P " 

1410 1415 

cy. ... « «. *yr »' « «• ftf " ^ "So 

1425 iaJU 

Ala cy. « S«Hi. «• «" S| 0 *» «» ° ly 3» S " 

_ «. "« «» iS, 1 " S " Pt ° 3!.*" **• 

„ „ Ty, Z\*« u. «. «■ - c " jb, 1 " val 01u 

1475 

« »h B Bin Cvs Gin Gly Asn Ser Lys Thr Cys Lys 
cys Leu Phe Asp Asn Phe B J« <*" oin y i5qo 

1490 1495 

t„ *» w ™ «■ «■« »" - Lys SffiT H1S *" M ° "°» 

1505 1!>AU , _ 

Gly cy S Asn Ser Gl^Glu Cy S Gly Trp AspGly Leu Asp Cys AUAla 

Olu ^Leu Ala Glu Gly Thr Leu Val He Valval Leu 



Asp Gin Pro Giu Asn i«u ™- — - * 

„.t „ «... «. L~ « «J »» »• «™ « »" "* 

L .„ 01, T« L.u L.u Hi. Thr M» w» S| 0 M9 SSP 

1570 lb ' b 
oly « M.t V.! WW. W *»« «, - "« - «$ 



1585 1550 



c 0 r t,eu Pro Gly Glu Gin Glu Gin 
Ly8 Lys Gin Arg Met Thr Arg Arg Ser Leu^Pro y ^ 

1605 



a , v.! «. oiy ~ «f v- Ue MP i3."" ** 

1620 

«! =.» >.P s.r »P HI. cy. m ... « «• -P 



1635 

Wu „. S« Hi. »» IX. «. "V * « So** ~ 

1650 1655 

S „ V.1 «! W .» {«„« « f 6 ?S TOr »80 
1665 1670 

„. «. v.i .j. « v-i .u ... « »■ «• « - ilSs Gly 

1685 

Tvfl ui ft Glv Ser Leu Trp Leu Pro 
Val lie Met Ala Lys Arg Lys Arg Lys His Gly ^ 
1700 

Ma ser Asn His Lys Arg Arg Glu 
Glu Gly Phe Thr Leu Arg Arg Asp Ala Ser As ^ 

1715 
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m V.L «, .U. W «• ~ - »" V " 

s „ »r«. «. - - bw ~ ^ " p vai -° 

«v « «■ "I «■ - »■ 

17ob 

L „ « „. «. »p « - •» «."« « - Trp 55."" "° 

„, „ u. »p - ^« ~ « - as.- - 
P „ m o" \» .i. «■ »• ~ - « i - as." 1 - v " 
Gly r« «• - - - is.~ - " 9 01y 
« - a - «• « «• a" G1 " " p s " 



Asn He He — ' 1865 

Thr Asp Arg Tnr Gly Olu Met ,1a Leu H.s ueu _ 

1875 

Arg Ala Asp ma - _ - 

1890 AD 



^ rw Ala Ser Leu Gin Ala Gin 
He Thr Asp Leu Val Tyr Gln^Gly Ala i870 

His Leu Ala Ala Arg Tyr Ser 
1885 

u. - «• 3,- *" " P "* 

u. «. -p - - •a.*" cy9 *" " u - " a v " "° " 5o 

r.r»u «. .» « - - - - i&r - v>i 55.- 

„ „ M . - - «' 55."" "° "* "* - 

Arg W U. r«...lT « - - .» - »« JS,^ «" "* 

MP wl r». v.. -p -p «» «■ °" \nr H " Ttp " a 
u. r« - « - - - - L -° Le ° w ' ss. 

1985 , rm Thr Pro Leu Phe Leu 

Ala Aen Arg Asp Met : Gin Asp Asn Lys GUClu 

, la Lvs He Leu Leu Asp His 
Ala Ala Arg Glumly Ser Tyr Olu ^ 

aso Arq Leu Pro Arg Asp 

»» «. ». «, w n. »• «p 0 «- «« »■» 5 20)5 

Ala ^ « « «S.»- " "* - " P 

Tyl - «. 5-™ - «» - s.- ihr " r "* 
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2090 2095 



Thr Pro Met Gly Lys Lys Ser Arc Jr, Pro Ser Ala Lys S.rThr Met 

2100 210b 
Pro Thr Ser^eu Pro Asn Leu jULy. Glu Ma Lys A-pAl. ^ Cly 

Ser Arc IrTl*. ^ ■« GlU ^ ^ VSo* ^ ^ 

2130 21Jb 

ser Val Thr Leu Ser Pro Val Asp Ser Leu Glu Ser Pro His Thr Tyr^ 



2145 



2150 



Val Ser Asp Thr Thr Ser ser Pro Met lie Thr Ser Pro Gly XI. 1- 

2165 21/u 
01. Ala Ser Pro Asn Pro Met Leu Ala Thr Ala Ala Pro ProAl. Pro 

2180 ■ 21bb 
Val His Ala Gin His Ala Leu Ser Phe Ser Asn Leu Hi. Ol« Met Gin 

2195 220U 
Pro Leu Ala His Gly Ala Ser Thr Val Leu Pro Serial Ser Gin Leu 

2210 22ib 
L eu ser His His His lie Val Ser Pro Gly 8«Gly Ser Ala Gly Ser^ 
2225 2230 

Leu ser Arg Leu His Pro Val Pro Val Pro Ala As P Trp Met A.nArg 



2245 



„et Glu Val Asn_Glu Thr Gin Tyr As^Glu Met Phe Gly gtVl Leu 



2260 



A1 a Pro Ala Glu Gly Thr His Pro Gly He Ala Pro Ginger Arc Pro 

2275 2280 
Pro Glu Gly Lys His He Thr Thr Pro Arc Glu Pr^Leu Pro Pro He 

2290 22yb 
val Thr Phe Gin Leu He Pro Lys Gly Ser HeMa Gin Pro Ala 01^ 
2305 2310 

Ma Pro Gin Pro Ginger Thr Cys Pro ProMa Val Ala Gly Pr^Leu 

Pro Thr Met Tyr Gin He Pro Glu Met Ala Aro Leu Pro Serial Ala 

2340 2345 
Phe Pro Thr Ala Met Met Pro Gin Gin Asp Gly Gin ValMa Gin Thr 

2355 2360 
XI. Leu Pro Ala Tyr His Pro Phe Pro Ala Ser V^Gly Lys Tyr Pro 

2370 2375 
Thr Pro Pro ser Gin His Ser Tyr Ala Ser Ser^Asn Ala Ala Glu a* 
2385 2Jyu 

Th r Pro Ser His 8« Oly His Leu Gin olyGXu His Pro Tyr Le^Thr 

Pro Ser Pro Glu Ser Pro As P Gin Trp Ser Ser Ser Ser Pro Hi. Ser 

2420 242b 
Ala Ser Asp Trp Ser Asp Val Thr Thr Ser Pro Thr ProGly Cly Ala 

2435 244U 
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cly ow „ y «. «, «» « «» «» »" »" iS.' 1 ' v " "° Hi * 

2450 

Asn Asn Met Gin Val Tyr Ala 
2465 

(2) INFORMATION FOR SEQ ID NO:20: 

(i) SEQUENCE CHARACTERISTICS: 
( (A) LENGTH: 2556 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
( X i) SEQUENCE DESCRIPTION : SEQ ID NO:20: 

Het Pro Pro Leu Leu Ala Pro Leu Leu cys Leu Ma Leu Leu Pro Ala 

L Ala Ala Arg Ly Pro Arc Cys Ser Gin Pro Gly Glu Thr Cys Leu 

20 ^ 



»,» c ly =iy w. cy= .i. «• »' »■ ™ ol " 8" *** ™ 

Gly 01, 1 » v.. .1, Pro «, c ys .1. MP Pro M. P,o M 

50 " 

.„ r , v Thr cvs His Val Val Asp Arg Arg 
Ser Thr Pro Cys Lys Asn Ala Gly Thr cys 8Q 



65 70 



Gly val Ala Asp Tyr Ala Cys Ser Cys Ala Leu Gly Phe Ser Gly Pro 



Leu Cy3 Leu Thr Pro Leu Asp Asn Ala Cys Leu Thr Asn Pro Cys Arg 

100 1U3 

. „ ».„ Te .u Leu Thr Leu Thr Glu Tyr Lys Cys Arg 
Asn Gly Gly Thr Cys Asp Leu Leu Tnr i, ^ 

115 

c ya Pro Pro =1, Trp ser ,w L y s s.r cy, .1. .U «• -P » 

130 135 
Ala ser Asn Pro Cys Ala Asn Gly Gly Gin Cys Leu Pro Phe Glu Ala 
145 150 

rvs His Cys Pro Pro Ser Phe His Gly Pro Thr Cys Arg 
Ser Tyr He Cys His <-ys * 17(J 175 

165 

r-i,, rin lvs Pro Arg Leu Cys Arg His Gly 
Gin Asp Val Asn Glu Cys Gly Gin Lys Pro a g 190 . 

180 io:> 

«, «. c,. »i. «. .1. «» ss -< - cys % cys 

tht ji. Z .1, Pro W *. .1. « « W S « C " ~ "° 

« « «. cm ».„ CIV "y «r *. «, »J ™r «, **P VI Jg 

225 2 

D ^ rw Phe Thr Gly Gin Asn Cys Glu Glu 
His Glu cys Ala Cys Leu Pro Gly Phe Tnr y 255 
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„ n. »p « g <*• - »» 5!S - - Mn ° lY ?» "* w ' 

Ml «» «, - - « - sis - - cy ' "° ns ol ° Ttp Thr 
Gly c — * ss - 

». T. «. is ~ «. ~ « - •» n« » 
Z « «. - - «v ™ ~ •» SS « - s « olu ffi n * 
MP w u. - - - <*■ 151 «" »' " a ~ w 

„ v. ju « ~ ~ C. jr. ,. - «• all ~ »' - 

Asn Pro Cys Asn Glu Gly 



355 



Le u Cys His Leu Asn Asp Ala Cys U. Ser 



370 

Ser Asn Cys Asp 
385 



380 



Thr Asn Pro Val Asn Gly Lys Ala He Cys Thr Cy, 



390 



„ « «, ~ «s «» - "* - ss ol " " p V>1 MP s cys 

s „ M „ «, «. « » o., .u - -v. c* n. - - 

420 

Gin Gly Tyr Thr Gly Pro Arg 



Leu 



Oly ser Phe Glu Cys Gin Cys Leu 



445 



435 



Cys Glu He Asp Val Asn Glu Cys 46Q 



Val Ser Asn Pro Cys Gin Asn Asp 



450 



s Met Cys Met Pro 
480 



465 



tip Glv Glu Phe Gin Cy 
Ala Thr Cys Leu Asp Gin lie Gly ei ^ 

fcsn Thr As 
490 

Leu Asp Lys lie Asn Glu Phe 



Gly Tyr Glu Gly Val His Cys Glu Val Asn Thr As P Glu Cys Ala Ser 



Ser Pro cys Leu His Asn Gly Arg cy. 



510 



Gin cys Glu cys Pro Thr Gly Phe 



Thr Gly His Leu Cys Gin Tyr Asp 



525 



515 



« w «. ^ «■ - ss - - - - ;a Ly ' " 
„ Z « - * b "* - va - 01 " *" IYt * 115 

2 «. «. «. - « - - ft « ~ ~ ~ S! Bi * 

56b 

*i rpu^ phe Thr Cys Leu Cys Arg 
Tyr Gly Ser Cys Lys Asp Gly Val Ala Thr Phe Thr ^ 

• „ = ri« Thr Asn He Asn Glu Cys Ser 
Pro Gly Tyr Thr Gly His Hxs cy. Glu Thr ^ 



Ser Gin Pro Cy 
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610 



615 



620 



PCT/US93/09338 



M, s.. Mr Pro cj; »P »« ™* S? l " 



Xyr 
625 

Asn Leu Asp Asp Cys 



650 



, SP w . n. « «r - «- - Si *° - "° " y & ~ " y 

sar „« Z ~ - »• s «• «• - ew is " cy ' 

705 M „ p~ «i - «u a «. - ~ - ?S sl> 

725 



Asn Ser 



A5P Pro Gly Trp Ser Gly Thr Asn Cys A sp He 

o *«„ Pro cys val Asn Gly Gly Thr Cys 
Asn A sn R sn Glu cys cl« Ser M» Pro cy ^ 



Tvr Lys Cys Asp Cys 
A/ 740 



755 



ly . w « « ~ - »i - - - w 01y ph * s " 

01y « «. « - - - - « - s - - pt ° » 
::i » «. .» -J - - - - s - - - ~ a? - 
cys « «. ^ ~ ». ~ - «• - ~ a wa p - 

^ u. « - ~ ^ „ «; «, «• S «■ - «" 

" S ' , . »»1 cvs Pro Thr »1. Oly »« W 01y 

Mp Tyr »1. «r Fl» g« CyS 860 

:;: GlyMa ,rc r — -r—'- 

„. m. «v ~ «v - 18 Gl " ™ w - " P 

„ „ I - - » s ~ «• - a ol * n ' ls " 

l^-.^-^S.r.r^cv.-C.Pro,,^- 
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s „ „, ,„ «. c .1. « - 31 "° w *" ~ s " ~ 

cy . « « X -r cv. - ™ » 55."- -» "° 

L „ c. "1 » «. - - <*• v ' 1 val - 

1010 , rlv Thr Cys Gin Asp Gly 

c e r Arg Pro Cys Leu Leu Gly Gly Tnr iQ40 
Glu Cys Asp Ser Arg r*^ j 1035 

10 " 01y L .„ - *• » - - oly S.- 



Arg 



* ~ cot- ser pro cys Lys Asn Gly 
Val His Trp Cys Asp Ser Ser Pro y ^ 

I 10O3 

Ly . cv. » «. - «. ««• ~ - - iS."" "° ~ 



Cys Gin Asn Leu 
* 1060 



Gly Lys uys — 1080 

Gly „ Z .» «. tv, %ir - ~ - !&- cye 01u val 

u. r«. « «- a.- 1 »■> is." 0,s " l ° -° 

<*• v., ;g » ~ - - - - - S.~ 

Blr w « .« - - C -1. -J,- ™ -» «" So"' "° 

s „ ,„ c, s r» .» »• **r "» « as."" 01y T " 

1155 AA 
« «. C V.! U. - •* - S " "* 

„. CV- « «■« ~ - «■ ffl.'" "» W ' S. 

1185 



Asp Leu 



«. » - p. «. «• - a!."- " 5 cly ™ ly 

1205 

« , U cs «. n. ». v., *, » « ~ IS.- *" 

1220 

* en rlv Thr Cys Val Asp Gin 
cpr Pro Lys Cys Phe Asn Asn Gly Tnr uy 
Val Ser Arg Ser Fro uy j ^ Q mo 
1235 

vtl 01J 01y « c ««. - « «' a.- oly ° lu - 
c «!°«r « - - - - - ffl,- - - a 

r~ - « - *" v>i phe b " 0,9 ^ 



1285 



„ rm ser Val He Asn Gly Cys 
is Thr Gly Arg Arg Cys Glu Ser vai ^ 



Arg Ala Gly Hi« o *»- — ■» " - " i 3 05 

Lys Gly ^ 5 Pr0 CyS " 1320 
Thr Ala Arg Gly Phe He Cys Lys cys Pro 



Ly9 Asn Gly^Gly Thr Cys Ala Ser Asn 

Ala Gly Phe Glu Gly Ala 
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1330 1335 1340 

^ Thr Cvs Gly Ser Leu Arg Cys Leu Asn 

Thr cys Glu Asn Asp Thr CyS y a35 5 1360 

~ ce n r o„ » «, ~ » - «■ - 3!.~ 
01y ,„ « «• «■ «;»- M ° s " S.~ ^ 

L „ „, .„ 2 - T y , ««- «r «- =v. f ;» - - 

01o £ „ ^ « g; - «• ~ is."" M " ° ly "° 

« r«. »• - »y s " - oly 8*"' GlY " 9 -° 

1425 „, «i» rvs Glu Leu Pro Glu Cys Gin 

He Pro Pro Pro Leu He Glu Glu Ala Cy^Glu 1455 

* Ala G1V Asn Lys Val Cys s e r Leu Gin Cys Asn Asn His Ala 

Glu Asp Ala Gly Asn ±y xnt\j 
1460 

ws 01y *, ».p «, «» «p 55,- - « - is.- pt ° Irp 

1475 

Lys „ cv, The «. s« ffi .U <*• «, «• tg,"- - "» 

1490 

H1 . cy . w ~ «. cv ; - - «■ •» ffi.- Ph * MP 55» 

1505 1 

«, C* .U « «• "V - «. - -P «• 

tys w - « .» »;,«*• "P «» ffl." s " 

»U „. Cys lu ^ »P .IT - "P »' «• SS," 1 "° "" 

1555 1S0U 

_ „„. Val val Val Leu Met Pro Pro Glu 
Arg Leu Ala Ala Gly Thr Leu Val Val Val v ^ 
1570 

01 „ ^ „ » s« ~ »- - - - ®r Mu s,t " 9 s« 

1585 lbyU 



* 6e « Ma His Gly Gin Gin Met 
L eu His Thr Asn Val Val Phe Lys Arg As^Ala 1615 

loOb 

„. ^ * «x «, «sr - « Lys ,u 

1620 iDZ 

, *i a Pro asp Ala Leu Leu Gly Gin 
Lys Arg Ala Ala Glu Gly Trp Al^Ala Pro Asp ^ 
1635 

^ m. - - - «' s " 01 " & a,r - K5 *" 

„ r~ «p »- «-» »» = iy !s. ,i< vai 1,1 mu 

1665 

Ito MP « «. «. <*. « «■ »■ ?i?„ s « cy - phe 

168b 



4fc PCT/US93/09338 
WO 94/07474 

„. « - «- « - ~ »»" T ~ 

L .„ >.„ n. » «. - - - ~ s;.- olu 
P „ .„ r» - - ss - - ** JS."' " a 

^ « u. - - «• - - «' TS»- ~ ~ *" »» 

« « - - *ir ~ - Ttp - 01y Phe - * l 

rlu Glu Leu Gly Glu Asp Ser 
Ser Glu Ala Ser Lys Lys Lys Arg Arg^lu 

17 80 

<5er Asp Gly Ala Leu Met Asp 
Val Gly L eu Lys Pro Leu Ly S Ser Asp ^ 

1795 

„ » «. « «. « vnr ™ - - L,s Ly ° phe 

M r= lu =>. - - « « sb.* s5p oin ™ -° 

1825 , T aanAla A ia Asp Leu Arg Met 

Hi 9 Arg Gin Trp Thr^ln Gin His Leu As^Ala 

rlnrlv Glu Val Asp Ala Asp Cys 
ser Ala Met Ala Pro Thr Pro Pro olntty ^ 
i860 

, . ri« Pro Asp Gly Phe Thr Pro Leu Met lie 
Met Asp val Asn Val Arg Gly Pro^sp y 1885 

1875 . „_ 

i«. Glu Thr Gly Asn Ser Glu Glu Glu 
A la ser cys Ser Gly Gly Gly Leu Glu 1900 
1890 

«. « - - »* s « ~ phs si. 1 * Gln 01y " a -° 

" 5 ,u. « «. -J - - * ^ 'JRT M " L " 

xu „ g w s „ ~ -p «. - »■ " 5 MU - " ~ 

Met Glv Arg Thr Pro Leu His Ala 
AU Asp Ala Asn He Gin Asp Asn^Met Gly 1965 
1955 

* Ala 6ln Gly Val Phe Gin He Leu He Arg Asn 
Ala Val Ser Ala Asp Ala Gln^iy WQ 

1970 . Hie Asc Gly Thr Thr Pro Leu 

Arg Ala Thr Asp Leu Asp^Ala Arg Met H.s As^ 

1985 , „ i ri„ Glv Met Leu Glu Asp Leu He 

U. Leu Ala Ala Arg^Leu Ala Val Glu Gly^Me 2Q15 

His Ala Asp Val Asn Ala vg MP ^ « Gly Lys^Ser Ala 

Val Asp Ala Ala Val Val Leu 
2045 

Me t Gin Asn Asn Arg Glu Glu Thr 



Asn Ser His 2025 

L eu His Trp Ala Ala Ala Val As^Asn .... . - ^ 
2035 



Leu 



Ly s Asn Gly Ala Asn Lys Asp 



PCT/US93/09338 



94/07474 



-134- 



Pro 
2065 



2050 

Leu Phe Leu 



2055 
Ala Ala Arg 



2060 



Glu Gly Ser Tyr 



Glu Thr Ala Lye Val 



Leu 



Leu Asp His Phe 



2070 
Ala Asn Arg Asp 



2075 



2085 



He Thr Asp His 
2090 



2080 



Met Asp Arg 
2095 



Leu Pro Arg 



Aap lie Ala Gin Glu «gi 



Xu Arg Met His His ASP Il-V* ^ 



2100 



Leu 



Leu Asp Glu Tyr 



Asn Leu val Arg 



ser Pro 



Gin Leu His Gly Ala 



2115 



2120 



2125 



Pro Leu 



Gly Gly Thr Pro 



Thr Leu Ser Pro Pro 



Leu cys Ser Pro Asn 



2130 



2135 



2140 



^ ty . ™ siy »i •» - «• vsl 5!!o 



Gly Tyr Leu Gly Ser 



2145 
Lys 



2150 



2155 



Pro Ser 



Ser Lys Gly Leu 



q P r lvs Glu Ala Lys Asp 
Ala cys Gly Ser bye n75 



2165 



2170 



Leu 



Lys Ala Arg Arg 



Lys Lys Ser GlnW 



2180 



Asp 



Ser Ser 



Gly Met Leu Ser Pro 



2195 



2200 



Gly Tyr Leu 
2210 



Ser Asp Val Ala Ser 



Gly Lys Gly Cys^eu Leu 

Glu Ser Pro His 
2205 

Pro Ser Pro Phe 



Val Asp Ser Leu 



Pro Pro Leu Leu 



2215 
S er Val Pro Leu 



2220 



Gin Gin ser Pro ser v«. 



Asn His Leu Pro Gly Met Pro AsP Q 



2235 



2225 
Thr His 



Leu Gly lie Gly 
2245 



His Leu Asn Val Ala 



Ala Lys Pro Glu Met 



2250 



2255 



Ala 



Ala Leu Gly Gly Gly 



Gly Arg Leu Ala Phe 



2260 



2265 



Arg Leu Ser His Leu Pro 



Val 



Ala Ser Gly Thr Ser 



Ser 



2275 

ser Ser Gly 
2290 



2280 



Glu Thr Gly Pro Pro 
2270 

Thr Val Leu Gly 
2285 



Gly Ala Leu Asn Phe Thr Val ^ 



Val Gly Gly Ser Thr Ser 



2295 



Gin Cys Glu Trp Leu 



2310 



Leu Asn Gly 
2305 

Pro Asn Gin Tyr As^Pro 



Ser Arg Leu Gin 



Ser Gly Met Val 



2315 



2320 



Leu Arg Gly Ser 



Val Ala Pro Gly Pro Leu 



Ser Thr Gin 



Ala Pro Ser Leu 
2340 



2330 
Gin His Gly Met 



2335 

Val Gly Pro Leu His 



Ser 



Ser Leu Ala Ala Ser Ala Lw 



2345 

Ser Gin Met Met 



2355 



« cor Thr Arg Leu Ala Thr 
Leu Pro Ser Tnr 2375 



Gin Pro His Leu 



2350 

Ser Tyr Gin Gly 
2365 

Val Gin Thr Gin 



2370 



Val Gin Pro Gin Asn 



Gin 
2385 

Ala Asn 



Leu Gin Met Gin 



2390 



He Gin Gin Gin Gin Ser Leu 



2380 

Gln Gin Asn Leu Gin Pro 
2395 

Pro Pro Pro Pro 
2415 



Leu Gin Pro Pro 



2405 
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Gin Pro His Leu Gly Val Ser Ser Ala Ala Ser Gly His guOly Arg 



2420 



Ser Phe Leu Ser Gly Glu Pro Ser Gin Ala Asp Val Gl^Pro Leu Gly 

2435 244U 



Pro Ser Ser Leu Ala Val Hi^Thr He Leu Pro olnQlu Ser Pro Ala 

Pro \ 

24 65 



2450" 2455 
Leu Pro Thr Ser Leu Pro Ser Ser Leu Val Pro Pro Val Thr Ala Al^ 



Gin Phe Leu Thr PjoPro Ser Gin His SjrTyr Ser Ser Pro ValQlu 
Asn Thr Pro grHl. Gin Leu Gin Valero Glu His Pro Ph^Leu Thr 
Pro Ser Pro_Glu Ser Pro Asp OlnTrp Ser Ser Ser Se^Pro His Ser 



2515 



Asn V. L 8.r Asp Trp Ser Glutty Val Ser Ser ProPro Thr Ser Met 

J ro Glu Ala 

2545 2550 2555 



2530 

Gin Ser Gin lie Ala Arg He Pro Glu Ala Phe Lys 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9723 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 10.. 7419 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

„ 0 ccc =cc c;; -c « « 2 «. » S S = S 



s q s s s s s s s ss s s ss si s; s 
s s s; s £ ss s - s s i ?s s s ss ss. 

30 35 

S £ S £ 52 Si - 5 - i - = ~ ™ ? i " 

50 " 

SSS 5 S 5 S S !S S 5 « = 3 - s 
5SESSSS5SS5SS52E 



48 



96 



144 



192 



240 



288 
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85 

80 _ _« „ rT CAT CCA TGC TTT 

saaaaaaaaaaa-. — - 

95 pit ATG CTC AGC CGG 

52 = S S S § = « S S S " ~ " 3 

110 ^ - TT ACA G GT AAG GAG TGC 

s s a a a s 5 a s i = - •* - » - 

- - ■? S S a = = = " 5 s S S = B 

Gin Trp Thr M| Ala cya 1$0 

-sssaaaaaasaaaaa 

160 tag TGT GAC ATT CCA GGA 

s a a a a = s 5 s s s « ~ m P " 

175 „ p rTr cCT GGT TCC TAC CAG 

s a a s a I a a = a a - - - - a 

190 „„n rAG TAC TGT GAC AGC CTG TAT 

5! a a a a a a a a a - « - - - 

210 m rrc ACC TGT CGG CAG 

sssSsss 5 =a=s S sss 

B S=SBB 5 SS5 = S SBBa! 

255 ~ AAC ACT TAC AAC TGC CGC TGT 

nnr TTT TGT GTG GAT GGG GTC AAC ACT T _ ^ c 

£„ fly SJ SI val **P <*Y val Asn Thr Ty 

270 - ~T>n r.aT GAA TGC 



90 



Asn Giy * 2 75 

270 * M rar GAT GTG GAT GAA TGC 

s s a a a a a a a ~ s - ~ - » 018 

290 ^ rrr ACC TGT GCC AAC CGC 

= = a a a a a a a a a - - » - - 

305 „^ TPG AGT GGA GAT GAC 

320 m „,p CCA GGC 



Asn Giy ^J-y " * 325 

H « «. «* «* »* occ 5° s s a a a a 

g c , K !S E c « S »•» «j <*• »» * SS 



336 
384 
432 
480 
528 
576 
624 
672 
720 
768 
816 
864 
912 
960 
1008 
1056 
1104 
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Ser Thr Cys IXe Asp Krg V.l Ala Ser Phe ser 
350 

r. rr» CGT CTC CTG TGT CAT CTG GAT GAT 
GGG AAG GCA GOT CTC cio ftsp 
Gly Lys Ala Gly Leu Leu Cys Hxs ^ 

S S S K " S S 5 i S E 

385 

430 4Jb 

PCT CGT TGT GAG ATG GAC ATC 
GOT TAT GCA GGA CCT CGT 1 ^ ^ Ue 

Gly Tyr Ala Gly Pro Arg v.y 45& 

450 

GAC CCC TGC CM f T GAT OCT ACC TGT CTG OAT 
A8 p Pro Cys Gin Asn Asp Ala Thr ^ 

46b 
495 

510 

545 

560 

575 



Cys Met 

GCA TGC 
Ala Cys 

CCC CTA 
pro Leu 



Cys Pro Glu 
1 365 

ATC AGC AAT 
He ser Asn 
380 

AAT GGG CAA 

Asn Gly Gin 
395 



GCT GAC 
Ala Asp 
410 

CCT TGT 
Pro Cys 
425 



TGC ACA GAA 
Cys Thr Glu 

GAG CAT GCA 
Glu His Ala 



TGT GAG 
Cys Glu 

AAT GAG 
Asn Glu 



TGT CTG AAG 
Cys Leu Lys 
445 

TGC CAT TCA 
Cys His Ser 
460 



AAG ATT 
Lys He 



CAT TGT 
His Cys 
490 



GGA GGC TTC 
Gly Gly Phe 
475 

GAA TTA GAA 
GlU Leu GlU 



AAT GGG 
Asn Gly 
505 



CAG TGT GTG 
Gin cys Val 



CCT GGT 

Pro Gly 

AGT ACT 
Ser Thr 



TAT GAA 
Tyr GlU 

AAC ATT 
Asn He 
570 



TTC ACT GGG 
Phe Thr Gly 
525 

CCG TGT CTG 
pro Cys Leu 
540 

TGC CAG TGT 
Cys Gin Cys 
555 

GAC AAC TGT 
Asp Asn Cys 



590 

m o** tpT TAC AGC AGC CCT TGC CTG 
CAG ATT GAT GAA TGT TAC AG ^ cyg ^ 

Gin He Asp Glu Cys Tyr aec ^ 



GAT GGT 
Asp Gly 
585 

GCC ATC 
Ala He 



ATT GAT TCC 
He Asp Ser 



1152 



1200 



1248 



1296 



1344 



1392 



1440 



1488 



1536 



1584 



1632 



1680 



1728 



AAC GAT 
Asn Asp 



TGC AGT GAC 
Cys Ser Asp 
605 

GGT CGC TGC 

Gly Arg Cys 
620 
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s ~ a a a a a a I - = s s 3 s s 
5 a a a a s s s a a a s g s s = 

aaaaaaaaaaaaaaaa, 

WSSS5S5SS53SSSS 

saaaaaaaaaaaaaas 
aaaaaaaasBaaaaaa 

720 

S S S S S 3 I B S 5 5 5 S 5 S B 

aasBaaBBBaBBaBBB, 

a s a a s a a s s a a a a 5 i s 
s a a s a a s s i " s s s » s 

785 

Trr TTT GAT GAC ATA 

5S S S B s a = i a s s a - - - - 

BBBBBBaaBBBBBBSB 

a a a a s a si = a a a a a a a a 

a a a a = s = = = S " s s = » s 
a a a a a a a a a a a a a a a a 

865 

aaBBBBaaBBBBBBBB 

880 



1920 



1968 



2016 



2064 



2112 



2160 



2208 



2256 



2304 



2352 



2400 



2448 



2496 



2544 



2592 



2640 



2688 



WO 94/07474 



PCI7US93/09338 



-139- 



s s a a a a 5 a as s a i a s a a 

s s a a 5 = s = s " a s s; s = » 
Ssssssassasaasas 

s a a a a » a » s a s s ° = s a " s 
s s s a s s = HI a a K = a s s * 

960 9 

sssaaaaaasaassss 
5 s a a a a a a a a a a a a a s 

990 

- is a a a p a a a ss s a a a a 
ssssssssss'ssissss 

1025 

„ tgt «. « « «. «* « ~ 2; ss a a a a a 

Thr cys Val Asp Gly Leu Gly ^Tyr r g 1Q50 
1040 

«, »ct o 00 «, « » « - - a a a a a a i« 

Tyr Thr Gly LyB Asn Cys Gln^inr n 1Q65 

- »'« * » « g » 5n s a a s a a a 

Pro Cys Lys Asn Lys Gly tw uy* 1Q80 i 0B5 > 

1070 lu/o 

s a a a a a a a a as a a a a a 
a a a a s a a a a a a a s as a 

1105 

a a a a s a a a a s s a as s s 

nnr nrr TAC TGT GAG GAG CAA CTC 

s a a a a a ss a a a ^ - - - 

1135 114U 

GAT M TGT =C= JCC -J « £ « « « » £ £ S £ 

Asp Glu Cys Ala Ser AM Pro Cys ein ufi0 U65 

1150 1155 



2736 



2784 



2832 



2880 



2928 



2976 



3024 



3072 



3120 



3168 



3216 



3264 



3312 



3360 



3408 



3456 



3504 
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s s a s a a - a s s. s s s s S. 55 

ssa'asss.aaaassaaa 
ssassassssaaasss. 
£saaajsaassssaaaa 

^rr TCT GAG GGC AGC 

s - s a a s s s ss a a *~ a - - 

1265 n(pr tpT GTT TGC CGT AGT 

sss s assess = 5 

s a s a s a aa a a a as a a a 

l295 TOC cro ** o« «. «* s* SS S 32 »» «" "° 

w 55 S° SJ cicely tm w «. »;J 0 »» 

1310 ttt TCC GGG GCA AGG TGC 

s a a a a a a a a aa *~ - - »~ 
ssssssas^sssasa 

- ? a as a a aa a a a a a a a 

His Thr Ala ser 13&5 

M T«t occ mt « « « « ss a a a a a 

1375 „^tv nr* TTC 



Giu 1380 

s a a a a as a a a aa a a a a 

1390 rrc AGC ACC CCT CCT 

s a a a a a a a a ss a s- - a -. 
» - !- s a a a a ss s a s a.s a 

Ala Thr Cys ser J 1430 

1425 



3552 
3600 
3648 
3696 
3744 
3792 
3840 
3888 
3936 
3984 
4032 
4080 
4128 
4176 
4224 
4272 
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«, nnn <vcc CAG TGG GAT GGG GGT GAC 

- s ss s 5 s s sgffi a. - »f - - 

1440 „ _ » a TrP TCC TC C CCA CTT 

S S 5 S B S SS 5 S 5 5J- - - - 
55^555 555558 5 555 

?SSSS,5SSS£SSSS£S 

~ - V? SS IS 5 Si 5 S! - 5 SS 5- ~ 

Cys Lys Tyr Asp Lys Tyr y 151Q is»aD 

s s s; 5 s = s ss s 5 s £5 « s 

- S 5S S S 55 S S 5 5 5 5 5 5 
5 SS 5 5 SS 5 5 S 5,S 5 5 5 5 
55 5 S 5 5 5 SS 5 55 5 5 5 5.5 

rrr TAT TAT GGT GAG AAG TCA GCT 

S 5 S 5 5 5 !5 5 53 3!.~ "* 

1585 

_ Trc cTT CCT GGT GAA CAA 

gct «. ». » a; « s 5 5 5 s S ™ w «. «» 

Ala net Lys Lys Gin Arg n 1610 

mr%fn ... rTC TTT CTG GAA ATT GAC AAC CGC 

Si SS S 515 5 5 55 5 5. - - »• - « « 

1615 

SS5 S55.SS5 5555 = 53 
£= 5555 = 55 SS 555 55. 

S S 5 5 S 5 5 5 55 5 S 5 SS 5 
-555SSSS5 5S55S55 

555*5555,555555555 

1695 



4368 



4416 



4464 



4512 



4560 



4608 



4656 



4704 



4752 



4800 



4848 



4896 



4944 



4992 



5040 



5088 
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5184 



- S S S S 5 S S S = = K » £. 

1710 1715 

S S £ a i 0 "° - S a S " S.™ 

ss a s ass s s ss a a a a ss a 
515 S 5 ss a a sss s: a a ss 5 a 

1760 

s a ss ss a a 5,= * s 5 a 5 s s ' s 

1775 1 

CM CAG CAC CTT « O f GCA « «. « £ £ S SS £ 

Gin Gin His Leu Glu Ala Aia ^ P i8Q0 1805 

1790 17 

»~ r**r rrr GAT GTG TTA GAT GTG AAT 

S £ S S £ S - £ £ S!S - 

0IC CGT =C CCA « -C XCC ACC CCA TTG ATG TTG £ = « •» 

Val Arg Gly Pro Asp Gly Cys Thr pro 1635 

1825 AO 
GGA GCC AGC TCA CAT TTG ACT GAT CAA GAT GAA GAT CCA GAG GAC TCT 
Gly Gly Ser Ser Asp Leu Ser Asp giu h t» ig5Q 
1840 10,3 

S E S £ S S »S ?S SS SS £ £ £ SS £ 

1855 1860 

s s a ss a s s a s s ss = s s a 

1870 1875 

S s a s sjs s a a ss a a ss s ss 
s a s a ss a s gs a a ss a 

1905 

a s a a a a a ss s a s ga a a 

1920 x *** 
ACT GAT CTA GAT GCC AGG ATC AAT GAT GGT ACT £ CCC CTG ATC CTG 
Thr Asp Leu Asp Ala Arg Met Asn Asp y i945 
1935 

s a a ss s a a a s a ss ss a a £ 

1950 1955 

a a a k a a a a a ss a a a as 



5280 



5328 



5376 



5520 

5568 

5616 

5664 

5712 

5760 

5808 

5856 

5904 

5952 
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5 s s s s s = s s s « = = S= - 

S5SSS5SS5S5SSSSS 

S ES S S 3 3 S £ 3 35 S 5 3 £ 
5SSSSS3SSSSSS3SS 

S S 5 S 5 S S S S5 S S S 25 S 
SSSSSSSS.SSS5S5 5S 

ssssssgsssssssss 

s s s s s S S S 5 S SS s s s s 

SSS5S.SSSSSSSSSS.S 
SSSS5S5SSSS5S5SS 

2 X 4 5 

- f S S 5 S S 55 5 5 S 5 5 5 S 

Thr Tyr Val ser ^t- 2 ^ 65 

S S SB 5 3 55 5 3 S 55 3 5 5 
S5SS5SSSSSS555SS 

2190 * -mn rTT CCC TCA GTG AGC 

5 5 5 5 SS 3 3 3 3.3 3 ~ - 

- ~ » S ffi 3 3 3 SS S 5 3 |r K 

Gin Leu Leu S «£ 5 M1S 2230 

« » « 3 S 3 5 3 3 3 3 S3 3 3 

Gly Ser Leu Ser Arg wsu 
2240 



6000 



6048 



6096 



6144 



6192 



6240 



6288 



6336 



6384 



6432 



6480 



6528 



6576 



6624 



6672 



6720 



6768 
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s s a s a s ss a a s as s a = 
s = s s s as s s a |l- = s s " a. 
as a s a a a a a a a s a a a a 
ssassssssssssass 

s a a a a a a ss a a a ss s s 

2320 

a a a a a a as a a a as a a a 
a as s a as s a a ss a a a s 
s a s a a a a a a ss a a a g,s 
s s a s a s s a as a a s ss s 

2385 

n n PTC CAG GGT GAG CAT CCC TAC 

s a s a s s s I s - s s. «, «.,«• » - 

2400 

ssaasssssassssss 
s s s s a s a s a s a a a s s & 
ss a a s a s a s as a a a a a 

1 2450 

»«. PAG GTT TAT GCG TGAGAGAGTC CACCTCCAGT 
CCA CAC AAC AAC ATG CAG GTT TAT ^ 

Pro His Asn Asn Met uxn v j 247Q 
2465 



6816 



6864 



6912 



6960 



7008 



7056 



7104 



7152 



7200 



7248 



7296 



7344 



7392 



7439 



24b3 

„ M — — — ^ 

M „ — - 7^ :™ 

..ru-UOm ATTCGTCAGT TTCACTGGGT ATCTGCAAGG CTTRTTGRTT 
„ 0 „ MI == A— » ^ ^ TTGGCCCAT 
ATTCTAATCT AATAAGACAA GTTTGTGG rcxTTCTTGC 
GT.TACTCTC TTCTATTTGG AOAATAAGAT GOATGCTTAT TGAAGCCCAG ACXTTCTT 

GCATTTTRAG CCCTGCAGGC TTCTGCCATA —AA «™ 



7499 

7559 

7619 

7679 

7739 

7799 
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* -PPrrrcTGA ATTGACCTAC GCATCTCCTC 
^rrriATTA TGCCCTGGAA TTCTGCCTGA 

»-™t cTTOtoctI TOMC1IIC T gg==cr TC a . 

" — ~ — — ; 

^C^CC TTTCRAGTAT GTTGTTTCTT »»« ' 

" "c — — — " ' 

o— o atcccttcar 

rr: — ™« — 
™ OT . — °™ 

HI*: »»— CCAGAARARG G^GGRAXG T~C OA— 

—° ~ cc cmmcci i 

C— , AA~ RRCTGTTGAG 

nlcG — ™ ««« 
M,C * CAGATGATCA GTCTCTACTG ATTATCTTGC 

CGGACCCAGT CACACTTGTC TTCCGTTTGT C» „™gTRTA 
«^M-C iBBnm TCRCRCCGTG TGGTCCGTGT TACTGGTRTA 

TP r TTAAAGG CCTGCTCACC AATCi i av^a a 

™ , — — " ~: 

TGGRCTTGTT TTCTRTCATC CAAA.CAGCC CTATAACAAG GTTGGRARRG GAGGRACTAT 
TGGACTTGTT „,„„,,„,„ TTCTTTTCCT CTGAAGCGCC CATGACATTC 

r^GCRGCC TTCCTATTTT CTOCT - ^ 
CCTTTGGGM CTAACGTAGA AACTCRACAG AACATTTTCC TT 

GRCRACTATA «G TO .GAC ^GCCCC TCRCCTO^ 

ru» — ^ ™ := 

ITT A0T,«T TGTAGATAAR GAATAGTTTT CTTCCTCTTC TCCTTGGGCC 
X— — AGTGCXG.gr RCCTGC»» 

^* CTGGGCATTT TGTRGATATT ARCRGGTGRR ««« 
^ CTCATTCCTT CTATGGCTGC RAGTATGCAT CAG^ = 
ATXTGTCTGT CGGTGGCCCC ATATGGAAAC CCTGCGTGTC TGTTGGCATA ATACTTTNCA 
ZZL —A, CCARATTTRT ~CR RARRTARTTA 

T H g,™rg OT g^tc tgc™™ TCXGGRTGTG GCAACATTCT 

rr: — — — zz. 

nonTATCTGC ATGGACCCTC ACCATCtll-i 
GGACCCATTT ATTATTCACA GATGGGGAGA ACCTATCTGC 

"loACR CRCRGTGCRG GGRGCCAGTG GCGATGGCGA ^ CCCCTGGGAA 
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\Yj £ AT IS CLAIM EDJS: 

, A pham.aceu.ical composition comprising a therapeutically 
etfecti « of a Notch protein; and a P— ca„» accept earner. 

5 v„« of rlaim 1 in which the Notch protein is a 

2. The composition ot claim 1 m 

human Notch protein. 

3 A pharmaceutical composition comprising a th.rapcu.ioH, 

encode* by the DNA sequence depicted . Ftgure 8A (SEQ D N ' 
, D NO*. 8C (SEQ ID WW). 9A (SEQ ID NO*, or 9B (SEQ ID N0.9), 
which is able .0 be bound by an antibody ,o a Notch protein; and a 
pharmaceutic* acceptable carrier. 

15 4 A pharmaceutical composition comprising a therapeutically 

eff ecti,e amount o, a protein, said protein comprising a Notch „ino acid 
seq „ence depicted in Ftgure 8A (SEQ ID N0:5>, SB (SEQ ID NO. >. SC (SEQ 
Zo:V, I (SEQ ID N0: 8 ). or 9B (SEQ .D N0:9>, which dtsplays one or 

pharmaceutically acceptable carrier. 

5 a pharmaceutical composition comprising a therapeutically 
effective amount of a protein, said protein comprising a fragment of a human 
et t . ollu n{ the extracellular domain of the protein; and 

25 Notch protein consisting essent.ally of the extracenu 

a pharmaceutically acceptable carrier. 

6 A pharmaceutical composition comprising a therapeutically 
effective amount of a protein, said protein comprising a region of a Notch protein 
30 clling *e HOP homologous repeats of the protein; - a pharmaceut.cally 
acceptable carrier. 
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7 A phamaceutol composition comprising a therapeutically 
effective amount of a fragment of a No.* protein lacking a ponton of the EOF- 
hlogous repeats of the protein, which fragment is able to be bound by an 
antibody to a Notch protein; and a pharmaceutical* acceptable earner. 

8 A pharmaceutical composition comprising a merapeuticall, 
effective amount of a protein, said protein comprising a hanoionally active 
portion of a Notch protein; and a pharmaceutical,, acceptable earner. 

9. Tne composition of claim 8 in which the Notch protein is a 
human Notch protein. 

,0 A pharmaceutical composition comprising a therapeutically 
effective amount of a chimeric protein, said chimeric protein 
functional,, active portion o, a human Notch protein Joined v,a a p*p.,de bond 0 
?: q „ence 0, a protein different from the Notch protein; and a pharmaceutic.,, 
acceptable carrier. 

, , the composition of claim 10 in which the functionally active 
, ponion of me Notch protein is encoded by the human cDNA sequence contained 
„ plasmid hN3 k as deposited wi* the ATCC and assigned acces- umb 
68609 or encoded b, the human cDNA sequence contained . plasmid hN* as 
deposited with the ATCC and assigned accession number 68611. 

, 5 12 A pharmaceutical composition comprising a merapeutically 

' effective amount of a protein, said protein comprising the amino acid sequence 
depioed in Figure ,0 (SEQ ID NO;.,); and a pharmaceuttcall, accept 



carrier. 



30 



13 A pharmaceutical composition comprising a therapeutically 
effective amount of a protein, said protein comprising One amino acid sequence 
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depicted in Figure 11 (SEQ ID N0:13); and a pharmaceutically acceptable 
carrier. 

. 14 A pharmaceutical composition comprising a therapeutically 
effective amount of a protein, said protein comprising the portion of a human 
Notch protein with the greatest homology to the epiderma. growth factor-hlce 
repeats 11 and 12 of the Drosophila Notch sequence as shown in Figure 4 (SEQ 
ID NO: 14); and a pharmaceutically acceptable carrier. 



10 



15 



15 A pharmaceutical composition comprising a therapeutically 
effective amount of a derivative or analog of a Notch protein, which derivative or 
analog is characterized by the ability ft. *». when expressed on the surface of a 
first cell, to bind to a Delta protein expressed on the surface of a second ceU; and 
a pharmaceutically acceptable carrier. 

16 A pharmaceutical composition comprising a therapeutically 
effective amount of a chimeric protein, said chimeric protein comprising a Notch 
protein joined via a peptide bond to a protein sequence of a protein different from 
the Notch protein; and a pharmaceutically acceptable carrier. 

17. A pharmaceutical composition comprising a therapeutically 
effective amount of a fragment of a Notch protein, which fragment is 
characterized by the ability ft. vfrro, when expressed on the surface of a f.rst cel.. 
to bind to a Delta protein expressed on the surface of a second cell; and a 
25 pharmaceutically acceptable carrier. 

18. A pharmaceutical composition comprising a therapeutically 
effective amount of a chimeric protein, said chimeric protein comprising a 
fragment of a Notch protein joined via a peptide bond to a protein sequence of a 
30 protein different from the Notch protein, which fragment is characterized by the 
ability ft. vitro, when expressed on the surface of a f.rst cel., to bind to a Delta 
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protein expressed on the surface of a second cell; and . pharmaceutical,, 

acceptable carrier. 

19 A pharmaceutical composition comprising a therapeutically 
5 effective amount of a protein, said protein comprising a derivative or ana,og of a 
D*. protein, which derivative o, ana,og is character^ by the abiiHy . «. . 

on the surface of a second cell; and a pharmaceutical!, accept** earner. 
,„ 20 A pharmaceutical composition comprising a therapeutically 

effective amount of a chimeric protein, said chimeric protein comprising a 
fragment of a Del. protein jomed via a peptide bond to a protein s^uence of a 
protein different from the Del, protein, which fragment is character,^ by *e 
ahiiity „ when expressed on me surface of a ft. cell, to b,nd to- a Notch 
!5 protein expressed on the surface of a second cell; and a pharmaceutical!, 
acceptable carrier. 

21 A pharmaceutical composition comprising a therapeutically 
effective amount of aW*, said protein comprising a derivative or analog of a 
20 serrate protein, which derivative or analog is characterized by the ability ,n mro 
when expressed on the surface of a first celi, to bind to a Notch protem expressed 
on the surface of a second cell; and a pharmaceutical* acceptable earner. 

22 A pharmaceutical composition comprising a therapeutically 
25 effective amount of a derivative or analog of a Notch protein, which derivative or 
analog is characterized by me ability ft, when expressed on the surface of a 
first cel.. to bind to a second protein expressed on the surface of a second cel., 
which second protein is selected from the group consisting of a Delta protem and 
a Serrate protein; and a pharmaceutical ly acceptable earner. 

30 
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23. A pharmaceutical composition comprising a therapeutically 
effective amount of a nucleic acid encoding a Notch protein; and a 
pharmaceutically acceptable carrier. 

5 24 A pharmaceutical composition comprising a therapeutically 

effective amount of a nucleic acid encoding a functional active portion of a 
human Notch protein; and a pharmaceutically acceptable earner. 

25 A pharmaceutical composition comprising a therapeutically 
Figure 10 (SEQ ID NO:ll); and a pharmaceutically acceptable earner. 

26 A pharmaceutical composition comprising a therapeutically 
effective amount of a nucleic acid encoding the amino acid sequence depicted » 

15 Figure 11 (SEQ ID NO:13); and a pharmaceutically acceptable earner. 

27 A pharmaceutical composition comprising a therapeutically 

fragment is characterized by the ability * «ro. when expressed on une surface of 
20 a first cel., to bind to a Delta protein expressed on the surface of a second cel., 
and a pharmaceutically acceptable carrier. 

28 A pharmaceutical composition comprising a therapeutically 
effective amount of a nucleic acid encoding a chimeric protein, said chimeric 
25 protein comprising a functionally active fragment of a human Notch protem ^ 
via a peptide bond to a protein sequence of a protein different from the Notch 
protein; and a pharmaceutically acceptable carrier. 

29 A pharmaceutical composition comprising a therapeutically 
30 effective amount of a nucleic acid encoding a fragment of a Delta protein which 
fragment is characterized by the ability in vitro, when expressed on the surface of 
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and a pharmaceutical* acceptable earner. 

30 A pharmaceutical condition cpH* a *e,ap«u«l, 

and a pharmaceutically acceptable earner. 

<■ 94 in which the nucleic acid is a 
Q 31. The composition of claim 24 in wmc 

nucleic acid vector. 

32 Ap^aceu.ic.uompo-Uioncompn^a^apeoticany 

Active an*,un . of an .** -** ** »' ™* ' 

15 pharmaceutically acceptable carrier. 

33 A pharmaceutical composition comprising a therapeutical,, 

20 # - „ „ ^icMse or disorder in a 

34 A method of treating or preventmg a disease 

prevention a therapeutically effective amount of a molecu 

function of a Notch protein. 

25 ioi m -u in which the disease or 

35 The method according to claim 34 in wmc 

disorderisama^^^ 

session of a Notch protein or of a N - ^ ^ 

an anti-Notch antibody, relative to said Notch activ.ty 
30 analogous non-malignant sample. 
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36. The method according to claim 34 in which the disease or 
disorder is cervical cancer. 

37. The method according to claim 34 in which the disease or 
5 disorder is breast cancer. 

38 . The method according to claim 34 in which the disease or 
disorder is colon cancer. 

39 The method according to claim 35 in which the malignancy is 
selected from *e group consisting of melanoma, seminoma, and lung cancer. 

40. The method according to claim 35 in which the subject is a 

human. 

4, n, e method according to claim 36, 37 or 38 m which 
moiec* is an antibody to Notch or a portion * said antibody containing ft. 

binding domain thereof. 

42 The method according to claim 36, 37 or 38 in which the 
m0 lecu.e is a protein consisting of at least the extracellular domain of a Notch 
protein or a portion thereof capable of binding to a Notch ligand. 

43. The method according to claim 36, 37 or 38 in which the 

• „<;«tina of at least the EGF homologous repeats of a 
25 molecule is a protein consisting of at least we 

Notch protein. 

44 the method according to claim 36, 37 or 38 in which the 

30 protein. 
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45 The method according to claim 36, 37 or 38 in which the 
molecuie is an oligonucleotide which (a) consists of at least six nucleotides; (b) 
comprises a sequence complementary to at least a portion of an RNA transcript of 
a Notch gene; and (c) is hybridizable to the RNA transcr.pt. 

46 A method of treating or preventing a disease or disorder in a 
subject in need of such treatment or prevention comprising administering to the 
subject a therapeutical effective amount of a molecule which promotes the 
function of a Notch protein. 



10 



15 



47 A method of treating or preventing a malignancy in a subject 
comprising administering to a subject in need of such treatment or prevention an 
effective amount of a Notch protein. 

48 A method of treating or preventing a malignancy in a subject 
comprising administering to a subject in need of such treatment or prevention an 
effective amount of a functionally active portion of a Notch protem. 

49. The method according to claim 47 in which the Notch protein 
20 is a human Notch protein. 

50 A method of treating or preventing a malignancy in a subject 
comprising administering to a subject in need of such treatment or prevention an 
effective amount of a chimeric protein, said protein comprising a functionally 
25 active portion of a Notch protein joined via a peptide bond to a protein sequence 
of a protein different from the Notch protein. 

51. The method according to claim 49 in which the human Notch 
protein comprises the amino acid sequence depicted in Figure 10 (SEQ ID 
30 NO: 11) or Figure 11 (SEQ ID NO: 13). 
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52 A method of treating or preventing a malignancy in a subject 
comprising administering to a subject in need of such treatment or prevention an 
ZL lount of a derivative or analog 0, a Notch protein, wh.ch denvattve 0 
lg ,s character by me a,,,,, „ Wben pressed on me surface f a 
r , [St L ,o bind to a second protein expressed on me surface of a second cell. 
!L second protein ,s selected from - group consisting of a Del,, protem and 
a Serrate protein. 

53 A method of treating or preventing a malignancy in a subject 
comprising administering ,0 a subject in need of such treatment or p— 
efflive amount o, a derivative or analog of a Delta protem, v*,ch 
liog ,s characterized by me ability * — - - -*» " • 

« L, to bind to a Notch protein expressed on the surface of a second cell. 

54 A method of treating or preventing a malignancy in a subject 
comprising administering to a subject in need of such treatment or prevention an 
He Lu„ t of a derivative or analog of a Serrate protein, whi* der.va.tve 
ollog is character,*, b, the ability * when expressed on the surface of 
. f,rs, cel., to bind ,0 a Notch protein expressed on the surface of a second ceil. 

55 A method of treating or preventing a malignancy in a subject 
comprising administering to a subject in need of such treatment or prevention an 
effective amount of a nucleic acid encoding a Notch protem. 

5 56 A meuwd of treating or preventing a malignancy in a subject 

comprising administering to , subject in need o, such treatment or prevention an 
effective amount of a nucleic acid encoding a functionally active por,K>n 0, a 

Notch protein. 

50 57. The method according to claim 55 in which the subject is 

human and the Notch protein is a human Notch protein. 
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58 A method of trying or preventing a malignancy in a subject 
comprising ***** » • * - ° f °' " 

fragment is characterized by the abiht, ft when «P— » * "J" ' 
a « ce„, >o bind to a second protein express on me surface of a second ceU 

Z ^ *~* - — - *• -* M " ' D * prctt,n 

a Serrate protein. 

59 A method of treating or preventing a malignancy in a subject 
comprising administering to a subject in need of such treatment or prevention an 
five Lun, of a nucleic acid encoding a fragment o, a Delta protetn, w*ch 
ragment is characterized by the abiUty > when expressed on the su face of 
11, c* to bind to a Notch protein expressed on the surface of a second ceU. 

, 60 A method of treating or preventing a malignancy in a subject 

comprising administering ,0 a subject in need 0, such treatment or P™« ta ™ 
He Lun, of a nucleic acid encoding a fragment of a Serrate protetn w 
fragment is characterized by me ability ft when expressed n the su to of 
a L celt, to bind to a Notch protein expressed on the surface o, a second ce„. 

0 

61 A method of treating or preventing a malignancy in a subject 
comprising administering to a subject in need of such treatment or prevention an 
effective amount of antibody to a Notch protein. 

25 62. The method according to claim 58 in which the antibody is 

monoclonaK 

63 A method for treating a patient with a rumor, of a tumor type 
characterized b, expression of a tf* gene, comprising admirer ing to the 
3. pai e« an effective amount of an oligonucleotide, which « 
consists o, a. leas, six nucleotides; (b) comprises a sequence complement 
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, e as, a portion of an RNA of - «* gene; and ,c, b ***** » 

the RNA transcript. 

64. The method according to claim 60 in which the patient is a 
5 human, and the Notch gene is a human gene. 

65 An isolated oligonucleotide consisting of at least six 
nucleotides, and comprising a seance complementary to at least a port™ of an 
RNA transcript of a W* " " 

10 RNA transcript. 

66. A pharmaceutical composition comprising the oligonucleotide 
of claim 65; and a pharmaceutical!, acceptable carrier. 

67 A method of inhibiting the expression of a nucleic acid 
seuuence encoding a Notch protein in a eel, comprising providing the ce„ with an 
effective amount of the oligonucleotide of clatm 65. 

68 A method of diagnosing a disease or disorder characterised by 
an aberrant level of Notch protein or activity in a patient, 
me ,evel of Notch protein expression or activity in a sample denved from me 
padent, in which an increase or decrease in Notch protein o, activity . the 
patient sample relative . the level found in such a sample from a norma, 
individual indices the presence of the disease or disorder in the pattern. 

69 A method of diagnosing a malignancy characterised by an 
increased amou, of a Notch protein or of a Notch derivative capable of being 
hound by an anti-Notch amibody, comprising measuring the amount », a Notch 
protein or of a Notch derivative capable of being bound by an anti-Notch 
Ily, ,„ a samp, containing o, suspected of containing mahgnan, «* a 
parent, in which an increase in the amount of the Notch prote,n or of me Notch 
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derivative capable of being bound by an anti-Notch antibody, in the sample, 
relative to said amount found in an analogous sample of non-malignant cells 
indicates the presence of the disease or disorder in the patient. 



5 



10 



70. The method according to claim 69 in which the malignancy is 
cervical cancer. 

71. The method according to claim 69 in which the malignancy is 

breast cancer. 

72. The method according to claim 69 in which the malignancy is 

colon cancer. 

73. The method according to claim 69 in which the malignancy is 
15 selected from the group consisting of melanoma, seminoma, and lung cancer. 

74. The method according to claim 69 in which the amount of the 
Notch protein or derivative is measured by a method comprising contacting the 
sample with an anti-Notch antibody such that immunospecific binding can occur, 

20 and measuring the amount of any immunospecific binding of the antibody that 
occurs. 

75. A method of treating or preventing a nervous system disorder 
in a subject comprising administering to a subject in need of such treatment or 

25 prevention an effective amount of a functionally active portion of a Notch protein. 

76. A method of promoting tissue regeneration or repair in a 
subject comprising administering to a subject an effective amount of a functionally 
active portion of a Notch protein. 

30 
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77. A method of treating a benign dysproliferative disorder in a 
subject comprising administering to a subject in need of such treatment an 
effective amount of a functionally active portion of a Notch protein, in winch the 
disorder is selected from the group consisting of cirrhosis of the liver, psonasts, 

5 keloids, and baldness. 

78. A substantially purified human Notch protein comprising the 
amino acid sequence encoded by the hN homolog as depicted in Figure 13 from 
amino acid numbers 1 through 2169 (SEQ ID NO:19). 

10 . 

79. A substantially purified human Notch protein compnsmg the 

amino acid sequence encoded by the hN homolog as depicted in Figure 13 from 
amino acid numbers about 26 through 2169 (as contained in SEQ ID NO:19). 

15 80. A substantially purified protein comprising the extracellular 

domain of the mature human Notch protein encoded by the hN homolog, as 
depicted in Figure 13 from amino acid numbers about 26 through 1677 (as 
contained in SEQ ID NO: 19). 

20 81 . A substantially purified protein comprising the EGF 

homologous repeats of the mature human Notch protein encoded by the hN 
homolog, as depicted in Figure 13 from amino acid numbers 26 through 1413 (as 
contained in SEQ ID NO: 19). 

25 82. A substantially purified protein comprising the EGF like 

repeats 1 1 and 12 of the mature human Notch protein encoded by the hN 
homolog, as depicted in Figure 13 (as contained in SEQ ID NO:19). 

83. A substantially purified protein consisting essentially of the 
30 extracellular domain of the mature human Notch protein encoded by the hN 
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homolog, as depicted in Figure 13 from amino acid numbers about 26 through 
1677 (as contained in SEQ ID NO: 19). 

84. A substantially purified nucleic acid encoding the protein of 

5 claim 78. 

85. A substantially purified nucleic acid encoding the protein of 

claim 79. 

10 86. A substantially purified nucleic acid encoding the protein of 

claim 80. 

87. A substantially purified nucleic acid encoding the protein of 

claim 82. 

15 gg The nucleic acid of claim 85 which is a DNA molecule 

comprising the sequence depicted in Figure 17 from nucleotide numbers 82 
through 7419 (as contained in SEQ ID NO:21). 

20 89. The nucleic acid of claim 80 in which the sequence encoding 

the extracellular domain is as presented in Figure 17 (as contained in 
SEQ ID N0.21). 

90. A recombinant cell containing the nucleic acid of claim 84, 87 

25 or 88. 

91 The composition of claim 2 in which the Notch protein 
comprises the amino acid sequence encoded by the hN homolog as depicted in 
Figure 13 from amino acid numbers 26 through 2169 (as contained in 
30 SEQ ID NO: 19). 
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92. A composition comprising a therapeutically effective amount 
of a Notch protein or Notch derivative, said derivative being capable of being 
bound by an anti-Notch antibody, for use as a medicament. 

5 93. A composition comprising a therapeutically effective amount 

of a molecule which antagonizes the function of a Notch protein, for use as a 
medicament. 

94. Use of a composition comprising a molecule which 
10 antagonizes the function of a Notch protein, for the manufacture of a medicament 
for the treatment of cervical cancer, breast cancer, or colon cancer. 
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GAATTCGGAG GAATTATTCA AAACATAAAC ACAATAAACA ATTTGAGTAG TTGCCGCACA 60 

CACACACACA CACAGCCCGT GGATTATIAC ACTAAAAGCG ACACTCAATC CAAAAAATCA 120 

GCAACAAAAA CATCAATAAA C ATG CAT TGG ATT AAA TGT TTA TTA ACA GCA 

Met His Trp He Lys Cys Leu Leu Thr Ala 
1 5 10 



AAC AAG GGC TTC ACG AA1 ILL AIL Lfib ml ux ml ..^ ..n . 
Asn Lys Gly Phe Thr Asn Pro lie Gin Phe Pro Phe Ser Phe Ser Trp 

115 



171 



TTC ATT TGC TTC ACA GTC ATC GTG CAG GTT CAC AGT TCC GGC AGC TTT 219 
Phe He Cys Phe Thr Vol He Vol Gin Vol His Ser Ser Gly Ser Phe 
15 20 25 

GAG TTG CGC CTG AAG TAC TTC AGC AAC GAT CAC GGG CGG GAC AAC GAG 267 
Glu Leu Arg Leu Lys Tyr Phe Ser Asn Asp His Gly Arg Asp Asn Glu 
30 35 ^ 

GGT CGC TGC TGC AGC GGG GAG TCG GAC GGA GCG ACG GGC AAG TGC CTG 315 
Gly Arg Cys Cys Ser Gly Glu Ser Asp Gly Ala Thr Gly Lys Cys Leu 
45 50 5b 

GGC AGC TGC AAG ACG CGG TTT CGC GTC TGC CTA AAG CAC TAC CAG GCC 363 
Gly Ser Cys Lys Thr Arg Phe Arg Vol Cys Leu Lys H.s Tyr Gin Ala 
60 65 70 

ACC ATC GAC ACC ACC TCC CAG TGC ACC TAC GGG GAC GTG ATC ACG CCC 411 
Thr He Asp Thr Thr Ser Gin Cys Thr Tyr Gly Asp Vol lie Thr Pro 
75 80 85 ™ 

ATT CTC GGC GAG AAC TCG GTC AAT CTG ACC GAC GCC CAG CGC TTC CAG 459 
He Leu Gly Glu Asn Ser Vol Asn Leu Thr Asp Ala Gin Arg Phe Gin 
95 100 1° 5 

AAC AAG GGC TTC ACG AAT CCC ATC CAG TTC CCC TTC TCG TTC TCA TGG 507 
Phe 
110 
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CCG GGT ACC TTC TCG CTG ATC GTC GAG GCC TGG CAT GAT ACG AAC AAT 555 



Pro Gly Thr Phe Ser Leu He Vat Glu Ala Trp His Asp Thr Asn Asn 
1 - 130 I 35 



125 



603 



AGC GGC AAT GCG CGA ACC AAC AAG CTC CTC A C CAG CGA TC TG 
Ser Gly Asn Ala Arg Thr Asn Lys Leu Leu lie Gin Arg Leu Leu Vol 
140 I 45 150 

CAG CAG GTA CTG GAG GTG TCC TCC GAA TGG AAG ACG AAC AAG TCG GAA 651 
Gin Gin Val Leu Glu Val Ser Ser Glu Trp Lys Thr Asn Lys Ser 
155 160 ^ 1/U 



TCG CAG TAC ACG TCG CTG GAG TAC GAT TTC CGT GTC ACC TGC GAT CTC 699 
Ser Gin Tyr Thr Ser Leu Glu Tyr Asp Phe Arg Val Thr Cys Asp Leu 
175 180 18b 

AAC TAC TAC GGA TCC GGC TGT GCC AAG TTC TGC CGG CCC CGC GAC GAT 747 
Asn Tyr Tyr Gly Ser Gly Cys Ala Lys Phe Cys Arg Pro rg Asp Asp 

190 195 <-Uu 

TCA TTT GGA CAC TCG ACT TGC TCG GAG ACG GGC GAA ATT ATC TGT TTG 795 
Ser Phe Gly His Ser Thr Cys Ser Glu Thr Gly Glu lie He Cys Leu 
205 210 215 

ACC GGA TGG CAG GGC GAT TAC TGT CAC ATA CCC AAA TGC GCC AAA GGC 843 
Thr Gly Trp Gin Gly Asp Tyr Cys His He Pro Lys Cys Ala Lys Gly 
220 225 230 

TGT GAA CAT GGA CAT TGC GAC AAA CCC AAT CAA TGC GTT TGC CAA CTG 891 
Cys Glu His Gly His Cys Asp Lys Pro Asn Gin Cys Val Cys Gin Leu 
235 240 245 

GGC TGG AAG GGA GCC TTG TGC AAC GAG TGC GTT CTG GAA CCG AAC TGC 939 
Gly Trp Lys Gly Ala Leu Cys Asn Glu Cys Val Leu Glu Pro Asn Cys 
255 260 
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ATC CAT GGC ACC TGC AAC AAA CCC TGG ACT TGC ATC TGC AAC GAG GGT 987 

]W His Gly Thr Cys Asn Lys Pro Trp Thr Cys He Cys Asn Glu Gly 
270 275 280 

TGG GGA GGC TTG TAC TGC AAC CAG GAT CTG AAC TAC TGC ACC AAC CAC 1035 
Trp Gly Gly Leu Tyr Cys Asn Gin Asp Leu Asn Tyr Cys Thr Asn H,s 
285 290 295 

AGA CCC TGC AAG AAT GGC GGA ACC TGC TTC AAC ACC GGC GAG GGA TTG 1083 
Arg Pro Cys Lys Asn Gly Gly Thr Cys Phe Asn Thr Gly Glu Gly Leu 
300 305 310 

TAC ACA TGC AAA TGC GCT CCA GGA TAC AGT GGT GAT GAT TGC GAA AAT 1131 
Tyr Thr Cys Lys Cys Ala Pro Gly Tyr Ser Gly Asp Asp Cys Glu Asn 
315 320 325 JJU 

GAG ATC TAC TCC TGC GAT GCC GAT GTC AAT CCC TGC CAG AAT GGT GGT 1179 
Glu He Tyr Ser Cys Asp Ala Asp Val Asn Pro Cys Gin Asn G y Gly 
335 340 345 

ACC TGC ATC GAT GAG CCG CAC ACA AAA ACC GGC TAC AAG TGT CAT TGC 1227 
Thr Cys lie Asp Glu Pro His Thr Lys Thr Gly Tyr Lys Cys His Cys 
1 350 355 360 

GCC AAC GGC TGG AGC GGA AAG ATG TGC GAG GAG AAA GTG CTC ACQ TGT 1275 
Ala Asn Gly Trp Ser Gly Lys Met Cys Glu Glu Lys Vol Leu Thr Cys 
365 370 375 

TCG GAC AAA CCC TGT CAT CAG GGA ATC TGC CGC AAC GTT CGT CCT GGC 1323 
Ser Asp Lys Pro Cys His Gin Gly He Cys Arg Asn Val Arg Pro Gly 
380 385 390 

TTG GGA AGC AAG GGT CAG GGC TAC CAG TGC GAA TGT CCC ATT GGC TAC 1371 
Leu Gly Ser Lys Gly Gin Gly Tyr Gin Cys Glu Cys Pro lie Gly Tyr 
395 400 405 410 
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AGC GGA CCC AAC TGC GAT CTC CAG CTG GAC AAC TCC AGT CCG AAT CCA 
Ser Gly Pro Asn Cys Asp Leu Gin Leu Asp Asn Cys Ser Pro Asn Pro 
415 420 4 " 

TGC ATA AAC GGT GGA AGC TGT CAG CCG AGC GGA AAG TGT ATT TGC CCA 
Cys lie Asn Gly Gly Ser Cys Gin Pro Ser Gly Lys Cys lie Cys Pro 
430 435 ^ u 

GCG GGA TTT TCG GGA ACG AGA TGC GAG ACC AAC ATT GAC GAT TGT CTT 
Ala Gly Phe Ser Gly Thr Arg Cys Glu Thr As. lie sp Asp Cys Leu 
445 450 455 

GGC CAC CAG TGC GAG AAC GGA GGC ACC TGC ATA GAT ATG GTC AAC CAA 
Gly His Gin Cys Glu Asn Gly Gly Thr Cys He Asp Met Val Asn Gin 
460 465 470 

TAT CGC TGC CAA TGC GTT CCC GGT TTC CAT GGC ACC CAC TGT AGT AGC 
Tyr Arg Cys Gin Cys Val Pro Gly Phe His Gly Thr His Cys Ser Ser 

475 480 4oj 

AAA GTT GAC TTG TGC CTC ATC AGA CCG TGT GCC AAT GGA GGA ACC TGC 
us Val Asp Leu Cys Uu lie Arg Pro Cys Ala Asn Gly Gly hr Cys 

495 500 DUJ 

TTG AAT CTC AAC AAC GAT TAC CAG TGC ACC TGT CGT GCG GGA TTT ACT 
Leu Asn Leu Asn Asn Asp Tyr Gin Cys Thr Cys Arg Ala Gly Phe Thr 
510 515 ^ 

GGC AAG GAT TGC TCT GTG GAC ATC GAT GAG TGC AGC AGT GGA CCC TGT 
Ny Lys Asp Cys Ser Val Asp He Asp Glu Cys Ser Ser Gly Pro Cys 
525 530 ^ 

CAT AAC GGC GGC ACT TGC ATG AAC CGC GTC AAT TCG TTC GAA TGC GTG 
His Asn Gly Gly Thr Cys Met Asn Arg Val Asn Ser Phe Glu Cys Val 
540 545 550 
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TCG GTG ACC TTC GAT GUI LAL iaa iri uuk w.u muu - n ^ u - 
Ser Val Thr Phe Asp Ala His Gin Tyr Gly Ala Thr Thr Gin Ala Arg 

580 585 

GCC GAT GGT TTG ACC AAI Utl iftb urn GTC CTA ATT OCT GTT TTC 
Ala Asp Gly Leu Thr Asn Ala Gin Val Val Leu He Ala Va Phe Ser 
— 595 &00 
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TGT GCC AAT GGT TTC AGG GGC AAG CAG TGC GAT GAG GAG TCC TAC GAT 1851 
Cys Ala Asn Gly Phe Arg Gly Lys Gin Cys Asp Glu Glu Ser Tyr Asp 
555 560 565 570 

TCG GTG ACC TTC GAT GCC CAC CAA TAT GGA GCG ACC ACA CAA GCG AGA 1899 
Asp 
575 

GCC GAT GGT TTG ACC AAT GCC CAG GTA GTC CTA ATT GCT GTT TTC TCC 1947 
Leu 
590 

GTT GCG ATG CCT TTG GTG GCG GTT ATT GCG GCG TGC GTG GTC TTC TGC 1995 
Val Ala Met Pro Leu Val Ala Val He Ala Ala Cys Val Val Phe Cys 
605 610 

ATG AAG CGC AAG CGT AAG CGT GCT CAG GAA AAG GAC GAC GCG GAG GCC 2043 
Met Lys Arg Lys Arg Lys Arg Ala Gin Glu Lys Asp Asp Ala Glu Ala 
620 &25 630 

AGG AAG CAG AAC GAA CAG AAT GCG GTG GCC ACA ATG CAT CAC AAT GGC 2091 
Arg Lys Gin Asn Glu Gin Asn Ala Val Ala Thr Met His His Asn Gly 
635 640 645 650 

AGT GGG GTG GGT GTA GCT TTG GCT TCA GCC TCT CTG GGC GGC AAA ACT 2139 
Ser Gly Val Gly Val Ala Leu Ala Ser Ala Ser Leu Gly Gly Lys Thr 
655 660 665 

GGC AGC AAC AGC GGT CTC ACC TTC GAT GGC GGC AAC CCG AAT ATC ATC 2187 
Gly Ser Asn Ser Gly Leu Thr Phe Asp Gly Gly Asn Pro Asn He He 
670 675 680 

AAA AAC ACC TGG GAC AAG TCG GTC AAC AAC ATT TGT GCC TCA GCA GCA 2235 
Lvs Asn Thr Trp Asp Lys Ser Val Asn Asn He Cys Ala Ser Ala Ala 
685 690 695 
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GCA GCG GCG GCG GCG GCA GCA GCG GCG GAC GAG TGT CTC ATG TAC GGC 2283 
Ala Ala Ala Ala Ala Ala Ala Ala Ala Asp Glu Cys Leu Met Tyr Gly 
700 705 710 

GGA TAT GTG GCC TCG GTG GCG GAT AAC AAC AAT GCC AAC TCA GAC TTT 2331 
Gly Tyr Val Ala Ser Val Ala Asp Asn Asn Asn Ala Asn Ser Asp Phe 
715 720 725 730 

TGT GTG GCT CCG CTA CAA AGA GCC AAG TCG CAA AAG CAA CTC AAC ACC 2379 
Cys Val Ala Pro Leu Gin Arg Ala Lys Ser Gin Lys Gin Leu Asn Thr 
1 735 740 745 

GAT CCC ACG CTC ATG CAC CGC GGT TCG CCG GCA GGC AGC TCA GCC AAG 2427 
Asp Pro Thr Leu Met His Arg Gly Ser Pro Ala Gly Ser Ser Ala Lys 
750 755 760 

GGA GCG TCT GGC GGA GGA CCG GGA GCG GCG GAG GGC AAG AGG ATC TCT 2475 
Gly Ala Ser Gly Gly Gly Pro Gly Ala Ala Glu Gly Lys Arg lie Ser 
765 770 775 

GTT TTA GGC GAG GGT TCC TAC TGT AGC CAG CGT TGG CCC TCG TTG GCG 2523 
Val Leu Gly Glu Gly Ser Tyr Cys Ser Gin Arg Trp Pro Ser Leu Ala 
780 785 790 

GCG GCG GGA GTG GCC GGA GCC TGT TCA TCC CAG CTA ATG GCT GCA GCT 2571 
Ala Ala Gly Val Ala Gly Ala Cys Ser Ser Gin Leu Met Ala Ala Ala 
795 800 805 810 

TCG GCA GCG GGC AGC GGA GCG GGG ACG GCG CAA CAG CAG CGA TCC GTG 2619 
Ser Ala Ala Gly Ser Gly Ala Gly Thr Ala Gin Gin Gin Arg Ser Val 
815 820 »25 

GTC TGC GGC ACT CCG CAT ATG TAACTCCAAA AATCCGGAAG GGCTCCTGGT 2670 
Val Cys Gly Thr Pro His Met 
830 

AAATCCGGAG AAATCCGCAT GGAGGAGCTG ACAGCACATA CACAAAGAAA AGACTGGGTT 2730 
GGGTTCAAAA TGTGAGAGAG ACGCCAAAAT GTTGTTGTTG AT GCA 
ACGAAAAATG AAAAATCTGT AACAGGCATA ACTCGTAAAC TCCCTAAAAA ATTTGTATAG 285 
TAATTAGCAA AGCTGTGACC CAGCCGTTTC GATCCCGAAT TC 2892 
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2. ASph 

3. ACI0 

4. AEGF(7-17) 

5. AEGF(9-26) 

6. AEGF(17-30) 

7. AEGF(7-9) 

8. AEGF(9-17) 

9. AEGF(17-26) 
1OAEGF(26-30) 
HAEGF(9-30) 
12AEGF(7-26) 
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1 GAATTCCGCT GGGAGAATGG TCTGAGCTAC 

61 GTGGGGAAAG CCACACTGGG CAAACGGGCC 

121 GCAGGGGGCC CGCAACAGCT GGAGGGCAGG 

181 GGGCAATACT GCCTTTACCC ATGAGCTGGA 

241 GGTGGTTATG TGCTTCCTTC AGGTGGC 



CTGCCCGTCC TGCTGGGGCA TCAATGGCAA 
AGGCCATTTC TGGAATGTGG TACATGGTGG 
TGGACTGAGG CTGGGGATCC CCCGCTGGTT 
AAGTCACAAT GGGGGGCAAG GGCTCCCGAG 



FIG.8A 



1 GAATTCCTTC CATTATACGT GACTTTTCTG 

61 CCTCTGGAGT TTGTCAGCTT TGGTCTTTTC 

121 GCGGGCATGC TCCAGTTTGG TCTGCGTCTC 

181 AACCCGGAAC TGAAGGCTGG CTCTCACCCT 

241 TGTTAGATGT GAATGTCCGT GGCCCAGATG 

301 GAGGAGGCAG CTCAGATTTG AGTGATGAAG 

361 TCATCACAGA CTTGGTCTAC CAGGGTGCCA 

421 GATGGCCCTG CACCTTGCAG CCCGCTACTC 

481 TGCAGGTGCA GATGCCAATG CCCAGGACAA 

541 GGCACGTGAT GCCAAGGTGT ATTCAGATCT 



AAACTGTAGC CACCCTAGTG TCICTAACTC 
AAAGAGCAGG CTCTCTTCAA GCTCCTTAAT 
AAGATCACCT TTGGTAATTG ATTCTTCTTC 
CTAGGCAGAG CAGGAATTCC GAGGTGGATG 
GCTGCACCCC ATTGATGTTG GCTTCTCTCC 
ATGAAGATGC AGAGGACTGT TCTGCTAACA 
GCCTCCAGAC CAGACAGACC GGACTGGTGA 
ACGGGCTGAT GCTGCCAAGC GTCTCCTGGA 
CATGGGCCGC TGTCCACTCC ATGCTGCAGT 
GTTA 



FIG.8B 



1 TCCAGATTCT GATTCGCAAC CGAGTAACTG ATCTAGATGC CAGGATGAAT GATGGTACTA 

61 CACCCCTGAT CCTGGCTGCC CGCCTGGCTG TGGAGGGAAT GGTGGCAGAA CTGATCAACT 

121 GCCAAGCGGA TGTGAATGCA GTGGATGACC ATGGAAAATC TGCTCTTCAC TGGGCAGCTG 

181 CTGTCAATAA TGTGGAGGCA ACTCTTTTGT TGTTGAAAAA TGGGGCCAAC CGAGACATGC 

241 AGGACAACAA GGAAGAGACA CCTCTGTTTC TTGCTGCCCG GGAGGAGCTA TAAGC 



FIG.8C 
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1 GAATTCCATT CAGGAGGAAA GGGTGGGGAG 
61 ACTCGTTCCC AGGTGGCTCC ACCGGCAGCT 
121 TTCAGAAAAT TCCAGAAAAG CCCTACCCCA 
181 GCAACTGGCA CACAAACAGC CAGCGTGTCT 
241 CAGAGCTG 

FIG.9A 



AGAAGCAGGC ACCCACTTTC CCGTGGCTGG 
GTGACCGCCG CAGGTGGGGG CGGAGTGCCA 
ACTCGGACGG CAACGTCACA CCCGTGGGIA 
GGGGCACGGG GGGATGGCAC CCCCTGCAGG 



1 CTAAAGGGAA CAAAAGCNGG AGCTCCACCG 

61 NCCCGGGCTG CAGGAATTCC GGCGGACTGG 

121 ATTCTAGACC GGGAGAACAA GCGAATGGCT 

181 AATCGCTCGC CCTGGACATC GAGGGATGCA 

241 ACTCGGATTT ACAAGCATGA CCAGCCTGCT 

301 CGACAGACAC GAGCTCTATG CAT 



CGGGCGGCNC NGCTCTAGAA CTAGTGGANN 
GCTCGGGCTC AGAGCGGCGC TGTGGAAGAG 
GACAGCTGGC CTCCAAAGTC ACCAGGCTCA 
GAGGATCAGA ACCGGTACCT GGATGGCATG 
TACAGGGAGC GTGANNTTTT CACATGCAGT 



FIG.9B 
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G GAG GTG GAT GTG TTA GAT GTG AAT GTC CGT GGC CCA GAT GGC TGC 
Glu Vol Asp Vol Leu Asp Vol Asn Vol Arg Gly Pro Asp Gly Cys 
15 10 15 

ACC CCA TTG ATG TTG GCT TCT CTC CGA GGA GGC AGC TCA GAT TTG AGT 
Thr Pro Leu Met Leu Ala Ser Leu Arg Gly Gly Ser Ser Asp Leu Ser 
20 25 50 

GAT GAA GAT GAA GAT GCA GAG GAC TCT TCT GCT AAC ATC ATC ACA GAC 
Asp Glu Asp Glu Asp Ala Glu Asp Ser Ser Ala Asn He He Thr Asp 
35 40 45 

TTG GTC TAC CAG GGT GCC AGC CTC CAG GCC CAG ACA GAC CGG ACT GGT 
Leu Vol Tyr Gin Gly Ala Ser Leu Gin Ala Gin Thr Asp Arg Thr Gly 
50 55 60 

GAG ATG GCC CTG CAC CTT GCA GCC CGC TAC TCA CGG GCT GAT GCT GCC 
Glu Met Alo Leu His Leu Alo Ala Arg Tyr Ser Arg Ala Asp Ala Ala 
65 70 75 

AAG CGT CTC CTG GAT GCA GGT GCA GAT GCC AAT GCC CAG GAC AAC ATG 
Lys Arg Leu Leu Asp Ala Gly Ala Asp Alo Asn Ala Gin Asp Asn Met 
80 85 90 ^ 



GGC CGC TGT CCA CTC CAT GCT GCA GTG GCA GCT GAT GCC CAA GGT GTC 
Glv Arq Cys Pro Leu His Ala Ala Vol Alo Ala Asp Ala Gin Gly Val 
y y 100 105 HO 

TTC CAG ATT CTG ATT CGC AAC CGA GTA ACT GAT CTA GAT GCC AGG ATG 
Phe Gin lie Leu He Arg Asn Arg Val Thr Asp Leu Asp Ala Arg Met 
115 120 125 

AAT GAT GGT ACT ACA CCC CTG ATC CTG GCT GCC CGC CTG GCT GTG GAG 
Asn Asp Gly Thr Thr Pro Leu He Leu Ala Ala Arg Leu Ala Val Glu 
130 135 HO 
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GGA ATG GTG GCA GAA CTG ATC AAC TGC CAA GCG GAT GTG AAT OCA G 
Gl Met Val Ala Glu Leu He Asn Cys Gin Ala sp Val Asn Ala Val 
y 145 150 155 

GAT GAC CAT GGA AAA TCT GCT CTT CAC TOG GCA GCT GCT GTC AAT AAT 
Asp Asp His Gly Lys Ser Ala Leu His Trp Ala Ala Ala Val Asn Asn 
160 165 

nr. GAG GCA ACT CTT TTG TTG TTG AAA AAT GGG GCC AAC CGA GAC ATG 
M It Ma £ Leu Leu Leu Leu Lys Asn Gly Ala Asn Arg sp Met 
180 185 1 

TAG GAC AAC AAG GAA GAG ACA CCT CTG TTT CTT GCT GCC CGG GAG GGG 
Gin Asp Asn Lys Glu Glu Thr Pro Leu Phe Leu Ala Ala^Arg Glu Gly 

AGC TAT GAA GCA GCC AAG ATC CTG TTA GAC CAT TTT GCC AAT CGA GAC 
f er T r Z Ala Ala Lys He Leu Leu Asp His Phe Ala Asn Arg Asp 

210 215 220 

ATC ACA GAC CAT ATG GAT CGT CTT CCC CGG GAT GTG GCT CGG GAT CGC 
He S Asp His Met Asp Arg Leu Pro Arg Asp Val Ala Arg Asp Arg 

225 230 235 

AT r tap tat hat ATT GTG CGC CTT CTG GAT GAA TAC AAT GTG ACC CCA 
25 SS "1 % He SS £S L I* Asp 0, V As, HI l*r Pro 
240 245 " u 

5SKSSS2SSE2SS5S5 

260 265 c« 

GGG CCC AAC AGA TCT TTC CTC AGC CTG AAG CAC ACC CCA ATG GGC AAG 
Gly Pro Asn Arg Ser Phe Leu Ser Leu Lyn H.s Thr Pro Met biy Lys 
275 280 ^ 
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AAG TCT AGA CGG CCC AGT GCC AAG AGT ACC ATG CCT ACT AGC CTC CCT 910 

Lys Ser Arg Arg Pro Ser Ala Lys Ser Thr Met Pro Thr Ser Leu Pro 
1 290 295 300 

AAC CT1 GCC AAG GAG GCA AAG GAT GCC AAG GGT AGT AGG AGG AAG AAG 958 
Asn Leu Ala Lys Glu Ala Lys Asp Ala Lys Gly Ser Arg Arg Lys Lys 
305 310 315 

TCT CTG AGT GAG AAG GTC CAA CTG TCT GAG AGT TCA GTA ACT TTA TCC 1006 
Ser EIS Ser iTu Lys VI Gin Leu Ser Glu Ser Ser Val Thr Leu Ser 
320 325 330 ^ 

CCT GTT GAT TCC CTA GAA TCT CCT CAC ACG TAT GTT TCC GAC ACC ACA 1054 
Pro Val Asp Ser Leu Glu Ser Pro His Thr Tyr Val Ser Asp Thr Thr 
340 345 

TCC TCT CCA ATG ATT ACA TCC CCT GGG ATC TTA CAG GCC TCA CCC AAC 1102 
Ser Ser Pro Met He Thr Ser Pro Gly He Leu Gin Ala Ser Pro Asn 

355 360 365 

CCT ATG TTG GCC ACT GCC GCC CCT CCT GCC CCA GTC CAT GCC CAG CAT 1150 
Pro Met Leu Ala Thr Ala Ala Pro Pro Ala Pro Val is Ala Gin H.s 
370 375 380 

GCA CTA TCT TTT TCT AAC CTT CAT GAA ATG CAG CCT TTG GCA CAT GGG 1198 
Ma Leu Ser Phe Ser Asn Leu His Glu Met Gin Pro Leu Ala His Gly 
■ 385 390 395 

GCC AGC ACT GTG CTT CCC TCA GTG AGC CAG TTG CTA TCC CAC CAC CAC 1246 
Ala Ser Thr Val Leu Pro Ser Val Ser Gin Leu Leu Ser H.s H.s H.s 
400 405 

ATT GTG TCT CCA GGC AGT GGC AGT GCT GGA AGC TTG AGT AGG CTC CAT 1294 
He Val Ser Pro Gly Ser Gly Ser Ala Gly Ser Leu Ser Arg Leu H.s 

420 425 

CCA GTC CCA G1C CCA GCA GAT TGG ATG AAC CGC ATG GAG GTG AAT GAG 1342 
Pro Val Pro Val Pro Ala Asp Trp Net Asa Arg Met Glu Val Asa Glu 
435 4411 



FIG.11C 
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ACC CAG TAC AAT GAG ATG TTT GGT ATG GTC CTG GCT CCA GCT GAG GGC 1390 
Thr Gin Tyr Asn Glu Met Phe Gly Met Val Leu Ala Pro Ala Glu Gly 
450 455 460 

ACC CAT CCT GGC ATA GCT CCC CAG AGC AGG CCA CCT GAA GGG AAG CAC 1438 
Thr His Pro Gly He Ala Pro Gin Ser Arg Pro Pro Glu Gly Lys His 
465 470 475 

ATA ACC ACC CCT CGG GAG CCC TTG CCC CCC ATT GTG ACT TTC CAG CTC 1486 
He Thr Thr Pro Arg Glu Pro Leu Pro Pro lie Val Thr Phe Gin Leu 
480 485 490 495 

ATC CCT AAA GGC AGT ATT GCC CAA CCA GCG GGG GCT CCC CAG CCT CAG 1534 
He Pro Lys Gly Ser He Ala Gin Pro Ala Gly Ala Pro Gin Pro Gin 
500 505 510 

TCC ACC TGC CCT CCA GCT GTT GCG GGC CCC CTG CCC ACC ATG TAC CAG 1582 
Ser Thr Cys Pro Pro Ala Val Ala Gly Pro Leu Pro Thr Met Tyr Gin 
515 520 525 

ATT CCA GAA ATG GCC CGT TTG CCC AGT GTG GCT TTC CCC ACT GCC ATG 1630 
He Pro Glu Met Ala Arg Leu Pro Ser Val Ala Phe Pro Thr Ala Met 
530 535 540 

ATG CCC CAG CAG GAC GGG CAG GTA GCT CAG ACC ATT CTC CCA GCC TAT 1678 
Met Pro Gin Gin Asp Gly Gin Val Ala Gin Thr He Leu Pro Ala Tyr 
545 550 555 

CAT CCT TTC CCA GCC TCT GTG GGC AAG TAC CCC ACA CCC CCT TCA CAG 1726 
His Pro Phe Pro Ala Ser Val Gly Lys Tyr Pro Thr Pro Pro Ser Gin 
560 565 570 575 

CAC AGT TAT GCT TCC TCA AAT GCT GCT GAG CGA ACA CCC AGT CAC AGT 1774 
His Ser Tyr Ala Ser Ser Asn Ala Ala Glu Arg Thr Pro Ser His Ser 
580 585 590 

GGT CAC CTC CAG GGT GAG CAT CCC TAC CTG ACA CCA TCC CCA GAG TCT 1822 
Gly His Leu Gin Gly Glu His Pro Tyr Leu Thr Pro Ser Pro Glu Ser 
1 595 600 605 
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CCT GAC CAG TGG TCA AGT TCA ICA CCC CAC TCT GCI TCT GAC TGG TCA 1870 

Pro Asp Gin Trp Ser Ser Sen Ser Pro His Ser Ala Ser Asp Trp Ser 
610 615 620 

GAT GTG ACC ACC AGC CCT ACC CCT GGG GGT GCT GGA GGA GGT CAG CGG 1918 
Asp Val Thr Thr Ser Pro Thr Pro Gly Gly Ala Gly Gly Gly Gin Arg 
625 WO 635 

GGA CCT GGG ACA CAC ATG TCT GAG CCA CCA CAC AAC AAC ATG CAG GTT 1966 
Gly Pro Gly Thr His Met Ser Glu Pro Pro His Asn Asn Met Gin Val 
640 645 650 655 

TAT GCG TGAGAGAGTC CACCTCCAGT GTAGAGACAT AACTGACTTT TGTAAATGCT 2022 
Tyr Ala 

GCTGAGGAAC AAATGAAGGT CATCCGGGAG AGAAATGAAG AAATCTCTGG AGCCAGCTTC 2082 

TAGAGGTAGG AAAGAGAAGA TGTTCTTATT CAGATAATGC AAGAGAAGCA ATTCGTCAGI 2142 

TTCACTGGGT ATC1GCAAGG CTTATTGATT ATTCTAATCT AATAAGACAA GT1TG1GGAA 3202 

ATGCAAGATG AATACAAGCC ITGGGTCtAl GT1TACTCTC TTCTATTTGG AGAATAAGAT 2262 

GGATGC1TA1 TGAAGCCCAG ACAUCTTGC AGCT1GGACT GCATTTIAAG CCCIGCAGGC 2322 

TTCTGCCATA TCCATGAGAA GATTCJACAC TAGCGTCCTG TTGGGAAITA TGCCC1GGAA 2382 

TTCTGCCTGA ATTGACC1AC GCATCTCCTC CTCCTTGGAC AITCTTTTGT CTTCATTTGG 2442 

TGCTTTTGGT T1TGCACCTC TCCGTGATTG TAGCCC7ACC AGCATGTTAI AGGGCAAGAC 2502 

CTTTGTGCTT TTGATCATTC TGGCCCATGA AAGCAACTT1 GGTCTCCTTT CCCCTCCTGT 2562 

CUCCCGGTA TCCCTTGGAG 1CTCACAAGG TTTACTTIGG TATGG1TCTC AGCACAAACC 2622 

TTTCAAGTAT GTTGTTTCTT 1GGAAAATGG ACATACTGTA TTGTGTTCTC CTGCATMAT 2682 

CA11CCTGGA GAGAGAAGGG GAGAAGAAIA CTTTTCTTCA ACAAATTTTG GGGGCAGGAG 2742 
A1CCCTTCAA GAGGCTGCAC CTWMim CTTGTCTGTG TGCAGGTC1T CATA1AAACT 2802 

FIG.11E 
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TTACCAGGftA GAftGGGTGTC AGTT1GTTGT TTT1CTG1GT ATGGGCCTG6 TCAGTGTAAA 2862 

GTTTTATCCT TGMAGTCIA GTTACTATGA CCCICCCCAC TTTTTTMAA CCAGAAAAAG 2922 

GTTTGGAATG T1GGAATGAC CAAGAGACAA GTTAACTCGT GCAAGAGCCA GTTACCCACC 298? 

CACAGGTCCC CCTACTTCCT GCCAAGCATT CCATIGACTG CC1GTATGGA ACACAT11GT 3042 

CCCAGATCTG AGCATTCTAG GCCTGTTTCA CTCACTCACC CAGCATATGA AACTAGTCTT 3! 02 

AACTGTTGAG CCTCTCCTT1 CATATCCACA GAAGACAC1G TCTCAAATGT TGTACCCTTG 3162 

CCATTTAGGA CTGAACTTIC CT1AGCCCAA GGGACCCAGT GACAGTTG1C TTCCGTTTGT 3222 

CAGATGATCA GTCTCTACTG ATTATCTTGC TGCTTAAAGG CCTGCTCACC AAICTTTCT1 3282 
TCACACCGTG TGGICCGTGT TACTGGTATA CCCAGTATGT TCTCACTGAA GACATGGACT 3342 
TTATATGTTC AAGTGCAGGA ATTGGAAAGT 1GGACIIGTT TTCTATGATC CAAAACAGCC 3402 
CIATAAGAAG GTTGGAAAAG GAGGAACTAI ATAGCAGCCT TTGCTAT1TT CTGCTACCAT 3462 
TTCTTITCCT CTGAAGCGGC CA1GACATIC CCT7TGGCAA CTAACGTAGA AACTCAACAG 3522 



FIG.11F 
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AACATTTTCC TTTCCTAGAG 7CACCTTT7A GATGATAATG GACAACTATA GACTTGCTCA 3582 

TTGTTCAGAC TGATTGCCCC TCACCTGAAT CCACTCTCTG TATTCATGCT CTTGGCAATT 3642 

TCTTTGACTT TCTTTTAAGG GCAGAAGCAT TTTAGTTAA1 TGTAGATAAA GAATAGTTTT 3702 

CTTCCTCTTC TCCTTGGGCC AGTTAATAAT TGGTCCATGG CTACACTGCA ACTTCCGTCC 3762 

AGTGCTGTGA TGCCCATGAC ACCTGCAAAA TAAGTTCTGC CTGGGCATTT TGTAGATATT 3822 

AACAGGTGAA TTCCCGACTC TTTTGGTTTG AATGACAGTT CTCATTCCTT CTATGGCTGC 3882 

AAGTATGCAT CAGTGCTTCC CACTTACCTG ATTTGTCTGT CGGTGGCCCC ATATGGAAAC 3942 

CCTGCGTGTC TGTTGGCATA ATAGTTTACA AATGG1TTTT TCAGTCCTAT CCAAATTTAT 4002 

TGAACCAACA AAAATAATTA CTTCTGCCCT GAGATAAGCA GATTAAGTTT GTTCATTCTC 4062 

TGCTTTATTC TCICCATGTG GCAACATTCT GTCAGCCTCT TTCATAGTGT GCAAACATTT 4122 

TATCATTCTA AATGGTGACI CTCTGCCCTT GGACCCATTT ATTATTCACA GATGGGGAGA 4182 

ACCTATCTGC ATGGACCCTC ACCATCCTCT GTGCAGCACA CACAGTGCAG GGAGCCAGTG 4242 

GCGATGGCGA TGACTTTCTT CCCCTG 4268 



FIG.11G 
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